Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2016 Jan 27;36(4):1056–1070. doi: 10.1523/JNEUROSCI.1921-15.2016

Why Don't We Move Slower? The Value of Time in the Neural Control of Action

Bastien Berret 1,, Frédéric Jean 2
PMCID: PMC6604817  PMID: 26818497

Abstract

To want something now rather than later is a common attitude that reflects the brain's tendency to value the passage of time. Because the time taken to accomplish an action inevitably delays task achievement and reward acquisition, this idea was ported to neural movement control within the “cost of time” theory. This theory provides a normative framework to account for the underpinnings of movement time formation within the brain and the origin of a self-selected pace in human and animal motion. Then, how does the brain exactly value time in the control of action? To tackle this issue, we used an inverse optimal control approach and developed a general methodology allowing to squarely sample infinitesimal values of the time cost from experimental motion data. The cost of time underlying saccades was found to have a concave growth, thereby confirming previous results on hyperbolic reward discounting, yet without making any prior assumption about this hypothetical nature. For self-paced reaching, however, movement time was primarily valued according to a striking sigmoidal shape; its rate of change consistently presented a steep rise before a maximum was reached and a slower decay was observed. Theoretical properties of uniqueness and robustness of the inferred time cost were established for the class of problems under investigation, thus reinforcing the significance of the present findings. These results may offer a unique opportunity to uncover how the brain values the passage of time in healthy and pathological motor control and shed new light on the processes underlying action invigoration.

SIGNIFICANCE STATEMENT Movement time is a fundamental characteristic of neural motor control, but the principles underlying its formation remain little known. This work addresses that question within the inverse optimal control framework where the challenge is to uncover what optimality criterion underlies a system's behavior. Here we rely on the “cost of time” theory that finds its roots into the brain's tendency to discount the actual value of future reward. It asserts that the time elapsed until action completion entails a cost, thereby making slow moves nonoptimal. By means of a thorough theoretical analysis, the present article shows how to sample the infinitesimal values of the time cost without prior assumption about its hypothetical nature and emphasizes its sigmoidal shape for reaching.

Keywords: action vigor, cost of time, movement time, optimal control, reaching, saccade

Introduction

Movement time (MT) is an inherent characteristic of motor control, but the profound principles underlying its formation, be they neural or computational, remain in practice little understood. Prediction of MT is, however, a critical issue in all fields concerned with biological movement, such as humanoid/rehabilitation robotics, neuroprosthetics, brain-machine interfaces, or computer animation. In neuroscience, investigating the underpinnings of one's own motion pace is of crucial importance as many motor disorders, such as Parkinson's disease, lead to bradykinesia (Berardelli et al., 2001). This symptomatic movement slowness, not reducible to musculoskeletal or sensorimotor deficits, seems to be due to a lack of implicit “motor motivation” (Mazzoni et al., 2007; Baraduc et al., 2013). Such a reduced action vigor mainly originates from a dysfunction of the basal ganglia (BG) and the dopamine system (Desmurget and Turner, 2010; Turner and Desmurget, 2010; Brücke et al., 2012), thereby showing that MT is not just an emergent property of the task but a planned quantity. Although it is well understood why movements cannot be too fast, as exemplified by the speed/accuracy trade-off (Fitts, 1954; Harris and Wolpert, 1998; Tanaka et al., 2006), few studies actually addressed the complementary question of why movements are not slower.

Recently, the theory of the cost of time (CoT) has emerged as a promising avenue to tackle the issue of action invigoration (Shadmehr, 2010; Shadmehr et al., 2010; Shadmehr and Mussa-Ivaldi, 2012). This theory was originally motivated by the natural tendency of the brain to decrease the actual value of future reward (Myerson and Green, 1995). Porting this psycho-economical decision making to motor control (Wolpert and Landy, 2012), Shadmehr and colleagues proposed that the purpose of any action would be to put the neural system in a more rewarding state. Therefore, the slower the movement, the smaller the acquired reward. In other words, slow movements may be undesirable because the very passage of time entails a cost per se, thus placing the problem within the normative framework of optimal control (OC) theory (Scott, 2004; Todorov, 2004). The generic concept of a CoT revealed itself sufficient to account for MT in a variety of motor tasks (Hoff, 1994; Liu and Todorov, 2007; Shadmehr et al., 2010; Shadmehr, 2010; Rigoux and Guigon, 2012). The actual shape of the time cost remains, however, rather elusive in neural motor control because it was chosen a priori in most existing investigations. Researchers have assumed linear (Hoff, 1994; Harris and Wolpert, 2006; Liu and Todorov, 2007), concave (hyperbolic, Shadmehr, 2010; Haith et al., 2012; or exponential, Huh et al., 2010; Rigoux and Guigon, 2012) and even convex (quadratic, Shadmehr et al., 2010) shapes for the CoT (Fig. 1a). This choice could be motivated by psychological/behavioral observations, imposed by the mathematical constraints of the problem's formulation or just guided by the researcher's intuition.

Figure 1.

Figure 1.

Illustration of the CoT theory. a, Previously proposed shapes of the CoT. Linear: G(t) = α t (Hoff, 1994; Liu and Todorov, 2007); hyperbolic: G(t) = α(1−1/(1+βt)) (Shadmehr, 2010; Shadmehr et al., 2010; Shadmehr and Mussa-Ivaldi, 2012); exponential: G(t) = α(1 − exp(− βt)) (Rigoux and Guigon, 2012); quadratic: G(t) = αt2 (Shadmehr et al., 2010). b, The CoT 0tl(x(s),u(s))ds (assumed to be linear here for simplicity) sums with the trajectory cost 0tl(x(s),u(s))ds, yielding a total cost 0th(x(s),u(s),s)ds that exhibits a “U” shape, the minimum being the optimal MT. It is noteworthy that, without the CoT, infinitely slow movements would be optimal for similar trajectory costs (which includes most of the costs proposed in the literature).

In the present study, we instead used an inverse OC approach where the goal is to automatically uncover what optimality criterion underlies a (presumably optimal) system's behavior. To this aim, we developed a complete and robust methodology for characterizing how the brain values time from basic experimental motion data. The hyperbolic nature of the temporal discounting of reward (Shadmehr et al., 2010; Haith et al., 2012) was reaffirmed for saccades but without making any prior assumption about its hypothetical nature. When applied to reaching, a striking sigmoidal shape of the CoT was discovered over the time interval of actual reach durations, thereby ruling out purely concave or convex time costs for limb movement control. These methodological and empirical breakthroughs may offer a unique opportunity to assess how the brain values time in neural movement control across tasks, individuals, or species.

Materials and Methods

Theoretical analysis

Working hypotheses.

This work relies upon the general assumption that biological movement is optimal with respect to some cost function (Engelbrecht, 2001; Todorov, 2004). It generally supposes that the trajectories triggered by the CNS can be accounted for by a certain infinitesimal cost h(x, u, t) depending on the system state, the motor command, and the time, respectively. Originally, models of OC for biological movement were developed in fixed time where MT was typically taken from experimental measurements. This restriction can be released in free time OC whereby MT emerges from the optimality of behavior (Pontryagin et al., 1964; Kirk, 1970). In this context and in agreement with the CoT theory, we further assume that h can be separated into a term that values time only, g(t) (the infinitesimal CoT) and a term that depends on the state/control variables, l(x, u) (Fig. 1b). A mathematical treatment of that problem, given below, shows that it is actually possible to compute g(t) by resolving an OC problem in fixed time t with known initial/final states, given a system dynamics and trajectory cost. When the dynamics is linear with a single control variable and the trajectory cost is quadratic (i.e., a single-input linear quadratic [LQ] problem), there moreover exist mathematical properties of uniqueness and robustness for the CoT, guaranteeing that the CoT can be identified unequivocally for that class of problem. Our main hypothesis is thus about the additive separability of temporal and trajectory costs, which is consistent with the computational motor control literature and coherent with all the previous works devised in fixed time (see below for a justification of this assertion). An alternative formulation, often seen in reinforcement learning, may have assumed multiplicative separability (e.g., exponential discounting in an infinite-horizon setting) (Sutton and Barto, 1998). Although we do not have theoretical results in the latter case, Rigoux and Guigon (2012) showed that, under certain restrictive assumptions (e.g., reward acquired on a single time step and state), a similar free time additive setting can be recovered. However, assuming a multiplicative temporal cost would be a major change of paradigm with respect to the motor control literature as the presence of such a term could modify all the predictions made by classical fixed-time optimal control models.

General methodology for identifying the CoT.

Consider a system dynamics ẋ = f(x, u), with state x ∈ ℝn and control u ∈ ℝm, and fix a target xf (f stands for final throughout the text). A dot above a variable stands for its time derivative. Given an input u(·) defined on an interval [0, tu], we denote by xu(·) the trajectory of ẋ(t) = f(x(t), u(t)) arriving at xf, i.e., satisfying xu(tu) = xf. We assume that the cost function writes as the combination of the CoT plus a trajectory cost, as follows:

graphic file with name zns00416-8031-m01.jpg

where the functions g and l are non-negative. The function l is classical to motor control and may capture various aspects of the trajectory, such as effort, energy, smoothness, or accuracy. It has been the subject of extensive investigations (e.g., Berret et al., 2011). It will be referred to as the trajectory cost throughout the study. What we know about l is that it must account for the system trajectories in fixed time (i.e., when MT is known). To get persuaded of this, it suffices to realize that a model predicting MT accurately would be meaningless if it does not predict the correct system trajectories in fixed time as well. In what follows, we shall assume that l is either known from the literature or can be identified via some inverse OC procedure performed in fixed time (e.g., Mombaur et al., 2010; Berret et al., 2011). The function g is the infinitesimal (i.e., instantaneous) CoT we are looking for, whose antiderivative is the actual CoT and will be denoted by G(t) − G(0) = 0tg(s)ds (in the sequel, we will assume that G(0) = 0).

We consider the following free time OC problems:

Given an initial condition x0, minimize the cost C(u, tu) among all inputs u(·) and all times tu such that xu(0) = x0 and xu(tu) = xf (by definition of xu).

The existence of minimal solutions u(·) with a finite time tu may be guaranteed under some technical conditions on the dynamics and on the cost (typically, compactness of the set of controls and convexity of f and l with respect to u, or f linear and l quadratic) (e.g., Lee and Markus, 1967). We will assume that such conditions hold here.

Let u(·) be a minimal solution of the problem. Then there exists a curve p(t), t ∈ [0, tu], in ℝn called the adjoint or costate vector, and a real number λ = 0 or 1, such that (x(·), p(·), u(·), λ) satisfies the conditions of the Pontryagin Maximum Principle (PMP) (e.g., Pontryagin et al., 1964; Lee and Markus, 1967; Todorov, 2006). In particular, defining the Hamiltonian of the problem is as follows:

graphic file with name zns00416-8031-m02.jpg

The adjoint vector satisfies the Hamiltonian equation ṗ = −x(t,x,p,u,λ); moreover, the free time setting gives the following additional equation (called transversality condition):

graphic file with name zns00416-8031-m03.jpg

The case λ = 0 occurs only for singular controls of the dynamics f(x, u) (these controls are such that the linearization of the system around the associated trajectory xu is not controllable). We will assume that no such control exist for f(x, u) (this is always true for controllable linear systems for instance), and so we can choose λ = 1 in ℋ. As a consequence, we obtain the following:

graphic file with name zns00416-8031-m04.jpg

On the other hand, it is obvious that u(·) is also a minimal solution of the following OC problem in fixed time tu as follows:

Minimize the cost

graphic file with name zns00416-8031-m05.jpg

among all inputs v(·) such that xv(0) = x0 (and xv(tu) = xf). The Hamiltonian associated with this problem is as follows:

graphic file with name zns00416-8031-m06.jpg

0 defines the same Hamiltonian equation than ℋ since 0x=x, so p(·) is also an adjoint vector for this OC problem in fixed time, and it is the only one since the adjoint vector associated with λ = 1 is unique.

Thus, for a given time tu and a given cost l, we can compute the value g(tu) in the following way. First, solve the fixed-time OC problem in time tu. The result is a triple (x(·), p(·), u(·)) from which the Hamiltonian can be calculated. Second, set

graphic file with name zns00416-8031-m07.jpg

to get the value of the infinitesimal CoT at time tu. A flowchart illustrating the process is given in Figure 2.

Figure 2.

Figure 2.

Flowchart of the methodology. First, a model of the plant dynamics f and a trajectory cost l replicating the experimental trajectories with known MT must be available (gray boxes). From these experimental trajectories, the initial and final states of the system can be estimated (x0 and xf), as well as the movement duration t (black box). Finally, the corresponding optimal control (OC) problem in fixed-time t must be resolved (central ellipse) to evaluate the (constant) Hamiltonian ℋ0 along the optimal trajectory; this precisely yields the value of the infinitesimal CoT at time t. The partial time derivative of the value function could be used if preferred.

A more elegant way to obtain the same conclusion is to proceed as follows. Let V(t, x0) be the value function of the OC problem in fixed-time t, that is:

graphic file with name zns00416-8031-m08.jpg

where the infimum is taken among all inputs u(·) such that xu(0) = x0 (and xu(t) = xf). It is the optimal cost of a motion in time t between x0 and xf. Then the time tu of an optimal solution u(·) of the free time problem satisfies the following:

graphic file with name zns00416-8031-m09.jpg

and so necessarily g(tu)+Vt(tu,x0)=0 (we assume here that V is differentiable with respect to t). It is well known from the Hamilton-Jacobi-Bellman theory that Vt (tu, x0) =0 (x(tu), p(tu), 1), where 0(x, p, 1) = maxv0(x, p, v, 1). Since for the optimal control u we have ℋ0(x(tu), p(tu), 1) =0(x(tu), p(tu), u(tu), 1), we recover in this way Equation 7. We did not use the standard way to define the value function: for a MT equal to t, this is usually V˜ (w, x(w)) = infwtl(xu(s),u(s))ds. Here we set V(tw, x(w)) = V˜ (w, x(w)), hence ∂V/∂t = −∂ V˜ /∂t.

Remarkably, the latter analysis shows that the derivation extends to stochastic settings (Stengel, 1986; Todorov, 2006). In that case, the infinitesimal CoT is simply the partial time derivative of the expected value function of the stochastic OC problem in fixed time tu. More precisely, for the stochastic dynamics tu. More precisely, for the stochastic dynamics dx = f(x, u)dt + g(x, u)dξ where dξ is a standard Wiener process, g(tu) results from the stochastic Hamilton-Jacobi-Bellman equation as follows:

graphic file with name zns00416-8031-m10.jpg

In the LQ Gaussian (LQG) case, the infinitesimal CoT can be easily computed because the value function has a parametric form whose parameters can be evaluated via the resolution of decoupled ordinary differential equations (e.g., Kappen, 2011).

In summary, the above theoretical considerations show that g(t) can be calculated by solving a deterministic or stochastic OC problem in fixed time t for some initial/final states of the system (x0 and xf). Movement duration and initial/final states are basic information that can be retrieved from experimental data. Thus, given a trajectory cost l and a model of the system dynamics, it is straightforward to sample the values of the mapping tg(t) on some time interval [tmin, tmax]. The boundary values correspond to the range where real observations can be taken in practice and come out experimentally. This is the empirical range of movement durations. As a consequence, the integral CoT (i.e., G) can only be identified up to a constant since the values of g(t) for t < tmin are unknown/unobservable. This constant shift, however, does not affect either the solution of the free time OC problem or the predicted amplitude–duration relationship over the range [tmin, tmax] (that will be exactly recovered on this interval). To infer the CoT outside of the range of actual measurements, we may devise at least two ways. First, the identified CoT may just be extrapolated for t < tmin and t > tmax. Alternatively, values of g(t) may be computed for unobserved times by relying on predictions of the empirical amplitude–duration relationship (e.g., obtained via regression analysis). Values inferred using the first and second methods will be displayed as dotted lines and solid lines, respectively, in all graphs depicting the CoT. Values inferred over the range of actual reach durations will be emphasized by filled circles added on the top of solid lines, which locate where the CoT was the most reliably identified.

Detailed solution for single-input LQ problems.

For deterministic single-input linear-quadratic problems, a more detailed account of the methodology can be given and closed-form solutions of the CoT can be obtained. This case is relevant as it includes basic eye and arm movements. The state of such systems can be described by x = (θ, …, θ(n−1)) ∈ ℝn, and then the dynamics has the form as follows:

graphic file with name zns00416-8031-m11.jpg

which is a single-input linear system = Ax + Bu, u ∈ ℝ. Typically, n = 2 or n = 3 for dynamic models of the arm or the eye (see below). Let us assume that a quadratic cost l(x, u) = (ukTx)2 has been determined (e.g., an “effort” cost). We choose the final point xf as an equilibrium state of the system. Up to a translation in θ, we can always assume xf = 0. We then choose a family of initial conditions x0(a) = (a, 0, …, 0), which are equilibrium states parameterized by the movement extent a > 0 (i.e., the amplitude of the motion). For every amplitude a > 0, we denote by t*(a) the duration of the motion between x0(a) and xf (which can be estimated experimentally), and by ua(·) a control minimizing the integral cost Ct*(a)(u) = 0t*(a)(ukTxu(t))2dt in fixed time t*(a) between xu(0) = x0(a) and xu(t*(a)) = xf = 0. Applying the above methodology, we have to compute ℋ0(x(t*(a)), p(t*(a)), u(t*(a)), 1) where:

graphic file with name zns00416-8031-m12.jpg

From the PMP, ua(t*(a)) minimizes ℋ0(x(t*(a)), p(t*(a)), u, 1) with respect to u, which means ∂ℋ0/∂u = 0, and replacing the solution in Equation 12, we obtain the following:

graphic file with name zns00416-8031-m13.jpg

Moreover, the value ua(t*(a)) can be seen to depend linearly on xu(0) in the LQ case, and so it depends linearly on a since xu(0) = x0(a) = ax0(1), In other words, ua(t*(a)) = aϕ(t*(a)), where the function ϕ( · ) is defined as follows: for every τ > 0, ϕ(τ) is the value u1(τ) of the control minimizing the integral cost Cτ(u) = 0τ(ukTxu(t))2dt in fixed time τ between xu(0) = x0(1) and xu(τ) = 0. The function ϕ(·) is a universal value of time that depends only on the system dynamics (A, B) and the trajectory cost (k) and not on the specific behavior of an individual. This universal value of time can be computed explicitly thanks to the equations given by Ferrante et al. (2005). Applying formula given in Equation 7, we finally obtain g(t*(a)) = ϕ(t*(a))2 a2.

Empirical observations show that the time t*(a) is typically an increasing function of the amplitude, so that its inverse a*(t) exists. We can then determine the function g(·) by g(t) = ϕ(t)2a*(t)2. In particular, if it appears from experiments that the function t* is approximately affine of the form t*(a) = αa + β, then the infinitesimal CoT can be written as g(t) = ϕ(t)2((1/α) t − β/α)2. Hence, it suffices to compute the universal value of time ϕ(t), which can be done explicitly, to recover the actual infinitesimal CoT from the experimental duration/amplitude mapping.

Interestingly, this analysis also reveals the effect of globally rescaling g by multiplying it by some positive parameter κ. This would induce a new amplitude κa*(t). Therefore, the rescaled CoT would lead to the affine amplitude–duration relationship t*(a) = αa/κ + β. It can be concluded that simply rescaling the CoT does not allow to change both the slope and the intercept of the amplitude–duration relationship. Hence, if both the intercept and slope are found to change experimentally, this variation cannot be attributed to a global rescaling of the time cost. The latter point is important if attempting to account for speed variations and interindividual differences (see Results).

Uniqueness and robustness properties for single-input LQ problems.

From the above considerations, it is clear that the values of the infinitesimal CoT may critically depend on the trajectory cost. This raises the question about the robustness and uniqueness of g in general as the cost l may be known only approximately or may even be quite different from the true cost without it being noticeable in the motion data under investigation. Although important, this question was not addressed in previous studies. The problem can be exemplified for planar point-to-point reaches where the torque change (Uno et al., 1989) and jerk (Flash and Hogan, 1985) trajectory costs are hardly distinguishable despite their different nature (dynamic vs kinematic costs). For the CoT theory, different trajectory costs may conceivably lead to divergent time costs. Thus, before identifying the CoT, the question of what trajectory cost underlies the system trajectories with known MT must be posed. It may be argued that the trajectory cost must be identified through the resolution of the following inverse OC problem in fixed time:

Given a set of recorded trajectories in time τ, find l such that every recorded trajectory y(t) is a minimum of the integral cost as follows:

graphic file with name zns00416-8031-m14.jpg

among all trajectories xu such that xu(0) = y(0) and xu(τ) = y(τ).

However, this inverse problem is ill posed in general, which means that its solutions l(x, u) are neither unique nor stable with respect to perturbations of the recorded data (which are affected by sensorimotor and measurement noise and subject to interindividual variability). Within the single-input LQ framework, however, powerful results of uniqueness and robustness can be obtained, as explained hereafter.

We thus assume that the class of admissible trajectory costs is the class ℒ of quadratic costs ℒ = {l(x, u) = u2 + xTQx + 2xTSu}, with standard hypothesis on the matrices Q and S (i.e., Q = QT ≥ 0 and S is such that l is a positive semidefinite quadratic form). Given a cost l ∈ ℒ, the classical fixed-time LQ theory asserts that the solution of the corresponding OC problem in time τ satisfies u(t) = Kτ(t)x(t), where the time-dependent matrices Kτ(t) are determined through some Riccati equations. Alternatively, Ferrante et al. (2005) showed that all solutions in fixed time are determined by only two (1 × n) matrices K, K+ (i.e., the set {Kτ(·), τ > 0} is uniquely characterized by a pair, K, K+). Following the latter approach, let us introduce the set 𝒦 of all such pairs, that is, (K, K+) belongs to 𝒦 if there exists a LQ problem associated with a cost l ∈ ℒ whose solutions are characterized by (K, K+). The following theorem is proved by Nori and Frezza (2004).

Theorem 1.

For any pair (K, K+) ∈ 𝒦, there exists a unique vector k ∈ ℝn such that (K, K+) describes the optimal solutions of the fixed-time OC problems associated with the following:

graphic file with name zns00416-8031-m15.jpg

Moreover, the map (K, K+) ↦ k is continuous (actually k = −K+T).

As a consequence, for single-input LQ problems, there exists a complete methodology for determining unequivocally and robustly the cost of the time g. The method is summarized as follows: (1) determine a pair (K, K+) ∈ 𝒦 such that the associated optimal trajectories match accurately the recorded ones in fixed time (this can be written as a least-square problem); (2) set k = −K+T and l(x, u) = (ukTx)2; (3) compute the function ∂V/∂t (t, x) by using the Hamiltonian ℋ0 associated with l(x, u); (4) for every time t, identify from experimental data an initial condition x*(t) corresponding to a movement in time t (if it exists for that t); and (5) set g(t) = −∂V/∂t (t, x*(t)). The latter analysis proves that the CoT can be unequivocally and robustly identified from experimental data for single-input LQ problems. Although restricted, this class of problems covers most of the applications considered in the present study, which reinforces the significance of our findings. For more general problems, either the literature already provides some widely accepted trajectory cost or numerical techniques of inverse optimal control can be used to identify the cost function best replicating the system trajectories in fixed time (e.g., Berret et al., 2011).

Computational procedures

Models for saccades.

The eye plant model was taken from Shadmehr et al. (2010). Briefly, the eye dynamics was single-input linear system as follows:

graphic file with name zns00416-8031-m16.jpg

where c1 = 1.165 × 10−5, c2 = 3.9 × 10−3, c3 = 0.241, and c4 = 1 for a human eye. The system state was x = (θ, θ̇, θ̈), and an effort cost l(x, u) = u2 was minimized. This problem falls within the single-input LQ framework described above.

We also performed simulations with a signal-dependent noise following Shadmehr et al. (2010) and identified the CoT in this stochastic context. Signal-dependent noise is a random variable with a normal distribution of mean zero and variance equal to kSDN u2. This problem then falls within the LQG framework, for which closed-form solutions are available to compute the value function and in particular its partial time derivative (for the equations, see Kappen, 2011). The default noise level (signal-dependent noise) was set to kSDN = 0.0075, and a terminal cost accounting for accuracy of the form xDx with D = diag(103, 1, 10−2) was added to the integral cost.

Models for reaching: Single degree-of-freedom (dof) limb.

For a 1-dof arm moving in the horizontal plane, the basic model used throughout the study was already described in numerous other studies (e.g., Hogan, 1984; Tanaka et al., 2006; Gentili et al., 2007; Gaveau et al., 2014) and is as follows:

graphic file with name zns00416-8031-m17.jpg

where is θ the shoulder joint angle, τ is the muscle torque, b is the friction coefficient (b = 0.87 here), I is the moment of inertia of the arm with respect to the shoulder joint (value estimated based upon Winter's table for each participant) (Winter, 1990) and u is the single control variable.

For the trajectory cost, we typically considered canonical quadratic costs of the form l(x, u) = (ukx)2, where x = (θ, θ̇, θ̈) denotes the system state. The two most famous examples are the minimum torque change corresponding to l(x, u) = u2 (i.e., k = 0) (Uno et al., 1989) and the minimum jerk corresponding to l(x, u) = θ2 (i.e., k = (0, 0, b)) (Flash and Hogan, 1985). However, other costs, possibly composite, may account for such planar movements. Therefore, additional perturbations of the effort cost were tested as follows: a torque-based cost (l(x, u) = 0.05u2 + τ2) (Nelson, 1983), an energy-based cost (absolute work, l(x, u) = 0.05u2 + | τθ̇ |) (Berret et al., 2008), and an acceleration-based cost (l(x, u) = 0.05u2 + θ̈2) (Ben-Itzhak and Karniel, 2008). All these costs were checked to predict symmetrical and smooth velocity profiles and thus to be plausible trajectory costs for such a type of movement under consideration. For visualization purposes, we considered when convenient l(x, u) = 0.05u2 and l(x, u) = 0.05θ2 for the torque change and jerk costs, respectively.

Sensorimotor noise is known to play a role on motor control during reaching, especially signal-dependent noise (Todorov and Jordan, 2002; van Beers et al., 2004). Therefore, we also tested signal-dependent noise among values in the following range kSDN = [0; 0.05; 0.075; 0.1; 0.125]. In this stochastic context, a terminal cost accounting for accuracy of the form xDx with D = 5diag(105, 103, 102) was added to the integral cost.

1-dof with muscle dynamics.

For the sake of completeness, we also tested a more advanced model of the arm, namely a rigid body actuated by two antagonist muscles modeled as second-order low-pass filters (Van der Helm and Rozendaal, 2000) with certain time constants. In that case, the single-joint arm dynamics could be written as follows:

graphic file with name zns00416-8031-m18.jpg

where the subscripts 1 and 2 denote the flexor and extensor muscles, respectively, ε is the muscle excitation, ui is the control variable, which can be thought of as motor neuron input, and σ is the time constant of the filter. Typically, the value is chosen ∼0.04 s in the literature for fast movements (e.g., Liu and Todorov, 2007; Guigon et al., 2007), but we tested values ranging from 0.03 s to 0.08 s to evaluate the effects of varying this parameter on the shape of the CoT. In the reported simulations, we minimized the following effort cost: l(x, u) = 0.005(u12 + u22).

2-dof arm.

For a nonlinear multijoint planar arm model, a full description and the parameters used for the present simulations can be found in previously published articles (e.g., Berret et al., 2008, 2011). Briefly,

graphic file with name zns00416-8031-m19.jpg

where θ = (θ1, θ2) and τ = (τ1, τ2) denote the joint angle and torque vectors, respectively. The quantities ℳ, 𝒦, and ℱ are the inertia, the Coriolis/centripetal, and the viscosity matrices, respectively.

Smooth torque signals were obtained by controlling the torque derivative to emulate the low-pass filter characteristics of muscles, which is a classical assumption done in other studies (e.g., Uno et al., 1989; Nakano et al., 1999) and is as follows:

graphic file with name zns00416-8031-m20.jpg

where u is the control variable.

Anthropometric parameters were taken from Winter's tables (Winter, 1990), and values were adjusted to each participant.

Numerical optimal control.

To solve a free time optimal control problem, a naive strategy might consist of testing different MTs until the best duration is found, by solving fixed-time problems at each iteration. This approach would be of course computationally intensive and rather inefficient. However, it can be seen from the PMP that the free time setting just introduces an additional equation (see Eq. 3). Indeed, the Hamiltonian has to be zero along the optimal state/costate trajectory, which can be exploited to find the solution without resorting to any trial-error procedure and without additional computational complexity. For resolving general optimal control problems (fixed- or free time), we relied here a classical method that consists of transforming the OC problem into a nonlinear programming problem with constraints. More specifically, we used an orthogonal collocation method that is efficiently implemented in MATLAB (The MathWorks), called GPOPS (Benson et al., 2006; Garg et al., 2010; Rao et al., 2010). The nonlinear programming problem was solved by means of the well-established numerical software SNOPT (Gill et al., 2005). We checked a posteriori that the Hamiltonian was either zero (in free time) or constant (in fixed-time) along the resulting optimal state/costate trajectories. For LQ or LQG problems, the simulations were also performed by using the closed-form solutions of the problem, and we checked that both numerical methods yielded the same trajectories and time costs.

Experimental procedures

Main experimental task participants.

Twenty right-handed healthy adults, without neuromuscular diseases and with normal or corrected-to-normal vision, participated in this study (10 males, age: 30 ± 5 years, mass: 67 ± 12 kg). Written informed consent was obtained from each participant in the study as required by the Helsinki Declaration and the EA 4042 local Ethics Committee. All the participants were naive to the purpose of the experiment.

Motor task.

Participants performed visually guided single-joint movements (rotation around the shoulder joint) in the horizontal plane. Participants stood in front of a large vertical screen where spotlight targets (3 cm diameter) were displayed. We tested 10 amplitudes ranging from 5° to 95°, and 10 repetitions were recorded for each amplitude (5 in the rightward and 5 in the leftward direction). Participants started with the arm orthogonal to the screen. A sequence of 20 targets was then displayed (a new target appeared every 3.5 s). The starting arm position was varied across the experiment and coincided with the previous target location. For each participant, 5 blocks of 20 movements were recorded, for a total of 100 movements per participant. The sequence of trials was fully randomized within each block, however, ensuring that one leftward and one rightward movement of each amplitude was presented within each sequence of 20 trials.

Instructions.

Participants were instructed to move at a spontaneous, comfortable speed. They were asked to point toward the displayed spotlight target without trying to touch it because it would otherwise induce trunk movements. They were required to keep the arm fully extended, to minimize trunk rotations while performing the task and to perform one-shot movements without correction. They also had to support the weight of their arm by themselves and try to move in a transverse plane as much as possible. A familiarization phase consisting of 20 trials was performed before the recording of the 100 trials. Participants were allowed to rest and relax their arm every 20 trials for several minutes (a “pause” word appeared on the screen for that purpose).

Data collection and processing materials.

Arm and head motion of the dominant arm were recorded by means of a motion capture system (Optitrack device). Ten cameras were used to capture the movement of five retro reflective markers (14 mm in diameter), placed at well-defined anatomical locations on the moving arm and head (acromial process, humeral lateral condyle, apex of the index finger, left and right sides of the frontal bone).

Data analysis.

All the analyses were performed with custom software written in MATLAB from the recorded 3D positions of the markers (sampling frequency of 250 Hz). Recorded signals were low-pass filtered using a digital fifth-order Butterworth filter at a cutoff frequency of 10 Hz (MATLAB filtfilt function). Velocity profiles were computed via numerical differentiation.

The movement onset time was defined as the instant at which the linear tangential velocity of the fingertip exceeded 5% of its peak and the end of movement as the point at which the same velocity dropped below the 5% threshold. Movement duration was first inferred from those values as it is a traditional method. However, given the purpose of the paper, a second method was used where the velocity profiles were fitted to minimum jerk velocity profiles (Flash and Hogan, 1985) to infer the movement duration (in the spirit of Botzer and Karniel, 2009). To do so, a global optimization using MATLAB's patternsearch function was performed, and the parameters of the best-fitting minimum jerk problem were identified. On average, both methods gave similar results, but the second method revealed itself to be more robust to variations of motion amplitude and noise, and was therefore much more reliable in general. This second method was thus retained. Finally, the angular position of the arm was also evaluated from the shoulder and finger markers and using some trigonometry. The error between the theoretical amplitude and the one inferred from our recorded data was 2.0 ± 0.9 degrees on average across participants. This indicated that the participants fulfilled the task correctly by respecting the required amplitudes and that the stimuli displayed on the screen were correctly calibrated.

Additional experiment.

Four right-handed healthy adults were tested during an additional experiment addressing the case of small amplitudes/times (age: 32 ± 2 years, mass: 66 ± 4 kg). The same protocol as the main experiment was used, except that a smaller target width (0.3 cm instead of 3 cm) and a smaller fingertip marker (9 mm instead of 14 mm) were used because specific focus was put on small motion extents. A first block of 100 movements was recorded for amplitudes ranging from 5° to 95° as for the main task. A second block of 100 movements was recorded for amplitudes ranging from 1° to 5°. The order of the two blocks was counterbalanced across subjects. The smallest tested movements were ∼12 mm here, which was still much above the accuracy of the motion capture system (<0.5 mm from calibration). A total of 200 reaches per participant was thus available in this dataset, ranging from very small to very large motion extents.

Results

Illustration for saccades

We first considered the case of saccades to test the effectiveness and relevance of our methodology. In a series of papers, Shadmehr and colleagues (Shadmehr et al., 2010; Shadmehr, 2010) already investigated in great detail the extent to which the CoT theory could account for the duration of horizontal saccades and their variations with respect to the reward assigned by the brain to the task. They concluded that a hyperbolic CoT replicated better the amplitude–duration relationship than linear or quadratic alternatives. However, “the timescales [were] too short to allow [them] to dissociate between hyperbolic and exponential temporal discount functions” (Shadmehr, 2010). Considering intertrial intervals, they nevertheless concluded later that an exponential CoT was inconsistent with the data (Haith et al., 2012). Here we relied upon the same experimental data reported by Collewijn et al. (1988), and we used the same linear model of the oculomotor plant and trajectory cost (i.e., “effort” cost, quadratic in the control variable) to identify the CoT underlying human saccades (as described also by Harris and Wolpert, 1998; Tanaka et al., 2006; Kardamakis and Moschovakis, 2009; Shadmehr et al., 2010).

In contrast with Shadmehr et al. (2010), the present investigation did not require guessing a parametric form of the way the brain values the passage of time. Instead, we directly computed the infinitesimal CoT at different times based upon either the affine amplitude–duration relationship or the true data points exhibiting a growth larger than linear for ample saccades reported by Collewijn et al. (1988). We identified two similar yet slightly different time costs (Fig. 3a,b, black and gray traces). First, our results confirmed previous findings by showing that the CoT underlying saccades had undeniably a concave shape, thereby ruling out other time costs, such as linear and convex ones. When fitting the infinitesimal CoT on the range of actual durations (i.e., [36.5–295 ms]), we found that the root mean square error (RMSE) for the exponential fit was only slightly better than the hyperbolic one (0.010 vs 0.015). Fitting g on a larger interval would be possible, but the result would then depend heavily on the extrapolation method. Second, the reward assigned to the task could be estimated from the asymptotic value of G, without the need for a parametric guess of the CoT. However, when a parametric model of CoT was available (not based on blind intuition though but on the shape of g), the reward could be estimated via the parameter α (Fig. 3c), without resorting to any trial/error adjustments. In that case, the reward discount rate could be evaluated as well via the parameter β (Fig. 3c). Third, when modeling signal-dependent noise and thus switching to the stochastic context as in Shadmehr et al. (2010) or Harris and Wolpert (1998), the identified values of the infinitesimal CoT were found to be larger than for the deterministic case (Fig. 3d). This increase of the CoT was mainly a consequence of the larger expected trajectory cost due to the effects of sensorimotor noise. As the level of noise increased, the CoT approached a hyperbolic rather than an exponential discounting of reward (Fig. 3d, RMSE). Because our methodology provides an exact solution to the identification problem, the original amplitude–duration relationship was perfectly recovered from free time OC simulations as expected (using the inferred CoT, see Fig. 3e). The associated optimal saccade velocity profiles are reported in Figure 3f. The important result was that in all cases the instantaneous CoT was a decreasing function of time such that G (the integrated CoT) always had a clear concave shape. Importantly, our theoretical results for single-input LQ problems ensure that this inferred CoT is not just an artifact but does possess such well-identified characteristics. Our findings thus rule out nonconcave time costs for the control of saccades and confirm the hyperbolic shape of the CoT (Shadmehr et al., 2010; Haith et al., 2012).

Figure 3.

Figure 3.

Cost of time underlying saccades, as inferred from data reported by Collewijn et al. (1988). a, Infinitesimal CoT g(t). To infer this cost, we used either the linear regression or the true data points exhibiting a growth larger than linear for amplitudes >40 degrees (see Collewijn et al., 1988). Dotted traces represent extrapolated parts of the CoT. Solid lines indicate values inferred using our inverse methodology. Our approach does not require fitting parameters or hypothesizing the shape of the time cost. b, Integral CoT G(t). The concavity of the CoT is obvious; and in particular, linear/convex costs can be ruled out (Shadmehr et al., 2010). c, Fitting of the infinitesimal CoT. We fitted g on the actual range of durations of observed saccades (i.e., from 36.5 to 295 ms). We considered both exponential and hyperbolic candidate functions. Best fitting parameters were as follows: (α, β) = (1.3, 4.3) and (α, β) = (0.9, 5.5) for the hyperbolic and exponential fits, respectively. d, Simulations in the stochastic case with different levels of signal-dependent noise (from kSDN to nkSDN with n = 2, 3, and 4, denoted by × n in the figure). RMSE is reported for hyperbolic (hyp.) and exponential (exp.) functions, respectively. With larger multiplicative noise, the CoT gets closer to the hyperbolic class of CoT. e, Verification of the amplitude–duration relationship as obtained from free time optimal control simulations with the CoT depicted in b (black trace). White-filled circles represent true data points (i.e., target values for the model). Black crosses represent simulated data (i.e., reconstructed values from the model). Both series of points matched perfectly because the methodology provides an exact solution to the problem of identifying the CoT associated with the observed amplitude–duration relationship. f, Velocity profiles of saccades of different amplitudes as predicted by the optimal control model.

Application to reaching

We then investigated in depth the CoT underlying the control of reaching. As far as we know, inferring the CoT for reaching has not been attempted yet. To start, we considered the case of a single-joint arm moving in the horizontal plane whose linear musculoskeletal dynamics was described in several studies (Hogan, 1984; Tanaka et al., 2006; Berret et al., 2008; Gaveau et al., 2014) and detailed in Materials and Methods. We relied on the effort required to generate a reach as trajectory cost (i.e., a quadratic cost in the control variable). Incidentally, it coincided here with the torque change (Uno et al., 1989), a prominent model known to replicate well the bell shape of velocity profiles during single-joint rotations (Gottlieb et al., 1989). This problem also falls within the single-input LQ class for which we have powerful mathematical results.

To uncover the CoT underlying such self-paced arm pointing movements, experimental data were first gathered and parameters such as movement extent and duration were computed. For all the participants, the relationship between amplitude and duration was approximately linear, in agreement with the existing literature (Brown et al., 1990; Gentili et al., 2007). Figure 4 illustrates this relationship for 4 participants separately, which was quantified via linear regressions based on all the 100 trials recorded and depicted as single gray dots (R2 > 0.75 for all the 20 participants). Even though the task was fairly simple, interindividual differences were noticeable. In particular, both the slope and the intercept of the regression line varied across participants.

Figure 4.

Figure 4.

Experimental amplitude–duration relationships for four different subjects (P1–P4) during arm reaching. Each dot indicates a single movement (trial) of amplitude lying between 5° and 95°. The result of a linear regression applied to the data is reported, namely, an affine fit of the form t = α a + β, with t being the MT in seconds and a the motion amplitude in degrees. Overall, even though intertrial variability was noticeable, a clear and significant (p < 0.001) affine trend was observed in all cases, as revealed by the relatively large R2 values.

For each single movement, information about its amplitude and duration was eventually used to estimate the value of g for that time. Figure 5a displays the values of g(t) when sampled on such a single-trial basis (gray dots). Each dot in this figure corresponds to a dot in Figure 4. Because of sensorimotor noise and intertrial fluctuations (clearly visible in Fig. 4), the identified CoT was itself quite noisy when inferred from single trials. To get a more reliable and noiseless estimation of g, we then exploited the affine law linking amplitude and MT to infer the CoT on the time interval where actual measurements were taken in practice (Fig. 5a, black traces). The time interval containing the empirical reach durations is emphasized with filled circles added on the CoT traces. Its boundary values, corresponding, respectively, to the faster and slower motions observed during the experimental session, varied between participants as they were instructed to move at their own pace. Outside of this interval, the shape of the infinitesimal CoT was less reliable as it was either computed based upon regression equations (yielding predicted MTs for untested amplitudes) or just extrapolated by assuming that the initial and asymptotic values of g were both zero (Fig. 5a, solid lines and dotted lines, respectively). In Figure 5b, the corresponding integral CoTs, G, are depicted. The CoT exhibited a striking logistic shape for all the 20 participants, regardless of their spontaneous pace and anthropometric characteristics. Importantly, this conclusion could be drawn from the most reliable part of the curve, that is, over the range of MTs that were recorded in practice (and without resorting to any sort of extrapolation). This nontrivial shape would have been hardly predictable using a parametric approach as the CoT is neither concave nor convex but sigmoidal (at least on the time interval of interest, i.e., from ∼500–1500 ms). As a byproduct, these results show that for reaching the CoT is not purely hyperbolic or exponential.

Figure 5.

Figure 5.

Cost of time underlying single-joint arm reaching, as inferred from data reported in Figure 4. a, Infinitesimal CoT, g(t), for each of the four participants (P1–P4). Solid black traces represent values of g(t) inferred from linear regressions. Experimentally, MTs belonged to a certain interval [tmin, tmax] (emphasized by black filled circles), but it was possible to identify g(t) on a larger time interval by using predictions from the regression equation (black solid line). Dotted lines indicate extrapolated values of g(t) (i.e., values of g(t) not computed via our methodology). Light gray dots indicate values of g(t) inferred from single trials: each gray dot corresponds to a dot displayed in Figure 4. b, Integrated CoT, G(t). These curves were obtained by trapezoidal integration of the infinitesimal CoTs for each participant separately. The cost G(t) is only identified up to a constant, but in all cases the CoT exhibited a sigmoid-like shape on the range of actual motion durations (∼500–1500 ms on the present data).

Verification through direct optimal control simulations in free time

Before going further, we performed free time optimal control simulations by using the identified g to verify numerically the effectiveness of our methodology. As expected, the fitted relationship between amplitude and duration was exactly the same in simulation and experiment (Fig. 6a). This was expected given that the method seeks the CoT that will exactly account for the original amplitude–duration relationship. The velocity profiles were also bell-shaped and simply scaled with speed in accordance with well-known experimental observations (Fig. 6b) (e.g., Gottlieb et al., 1989). It is worth noting that solving free time optimal control problems is not computationally more intensive than solving their fixed-time counterparts. In particular, resolving a free time problem does not necessarily require a trial/error procedure as it is often assumed or done in the literature (e.g., van Beers, 2008; Shadmehr et al., 2010; Rigoux and Guigon, 2012; but see Hoff, 1994).

Figure 6.

Figure 6.

Verification of the methodology via free time OC simulations. a, Amplitude–duration relationship recovered for participant P1 using the CoT identified in Figure 5b (black trace). Because the methodology is exact, the same regression equation as in Figure 4 is obtained. MT was left free and emerged automatically for each amplitude in these simulations. b, Associated movement kinematics. Velocity profiles remained bell-shaped in accordance with experimental observations and scaled with MT.

Consistency with respect to the fitting of the amplitude–duration relationship

In the preceding paragraphs, we assumed an affine relationship between amplitude and duration with the consequence of having non-zero MT at zero amplitude. Actually, other functions, such as concave ones, may fit the data equally well, with or without bias at the origin. To investigate the time cost consistency with respect to data fitting, we collected additional data involving reaching movements of amplitudes <5° because nonlinearity could mainly arise for small times/amplitudes (see additional experiment in Materials and Methods). While the shape of the amplitude–duration relationship was very close to linear for reaches >5°, these data allowed us to refine our analysis near the origin. Figure 7a shows the amplitude–duration relationship for amplitudes ranging from 1° to 5° (100 reaches) and 5° to 95° (100 reaches) for horizontal reaches with a fully extended arm. The data revealed that the relationship was more concave than linear in the vicinity of small amplitudes, even though an affine fit still performed relatively well overall (R2 > 0.85). Although it was not possible to decide between concave fits with or without intercept from the R2 values (light vs dark gray traces), the CoT appeared to be sigmoidal over the range of empirical MTs in any case (i.e., ∼300–1500 ms; see Fig. 7b). For all the participants, a similar bell shape for the function g starting from times ∼300 ms was found. The only noticeable difference between fits with or without intercept occurred for small times (corresponding to very short motion extents because the fitted curve was very steep near the origin) where g was found to increase as MT tended to zero in the latter case (dark gray traces). This finding is, however, hypothetical because it is not grounded on real data as we did not measure reaches with duration <300 ms in this experiment. Nevertheless, the integral CoT, G, exhibited a clear sigmoidal shape over the range of actual reach durations. The latter was fully characterized up to a constant shift corresponding to the (unknown) values of g(t) for t <300 ms. These values, however, do not affect the ability of the identified G to predict the correct amplitude–duration relationship for times of practical interest for reaching.

Figure 7.

Figure 7.

CoT consistency with respect to data fitting. a, Amplitude–duration relationship of two participants (left and right panels, respectively) who were asked to perform fully extended arm movements in the horizontal plane, for amplitudes ranging from 1° to 95°. In addition to linear functions, concave ones were tested for regression analysis (gray traces). The first log-based fitting had a zero-intercept constraint and was of the form t = αlog2a + 1) + γa (dark gray). The second log-based fitting (light gray) was chosen in the spirit of Fitts (1954) and the same as Young et al. (2009): that is, of the form t = αlog2(a/w + 1) + βa + γ where w denotes the actual width of the target (0.3 cm, i.e., ∼0.2 deg here). b, On the range of empirical times (∼300–1500 ms in these data), the CoT systematically exhibited a sigmoidal shape (filled circles on the traces). For smaller and larger times, we had to rely on extrapolations because data were no longer available (solid and dotted lines). Gray dots indicate values of g(t) identified on a single-trial basis and yield more noisy estimates.

Consistency with respect to trajectory costs, motor noise, and muscle dynamics

Before concluding about the sigmoidal shape of the CoT on the time interval of empirical reach durations, we further tested its consistency with respect to changes of trajectory cost, motor noise, and muscle dynamics (Fig. 8). Thus far, we relied on a simple quadratic effort cost (i.e., the torque change here). However, because a variety of trajectory costs could replicate the bell-shaped velocity profiles of such single-joint arm rotations, we assessed the robustness of the CoT with respect to other plausible trajectory costs. We considered the minimum jerk (Flash and Hogan, 1985), torque (Nelson, 1983), absolute work (Berret et al., 2008), or acceleration (Ben-Itzhak and Karniel, 2008) models. The extracted time costs are reported in Figure 8a. This numerical analysis confirmed the consistency of the shape of g with respect to l. A candidate trajectory cost must at least replicate the arm trajectories when MT is known; therefore, this condition was ensured before extracting the CoT (i.e., the smooth bell shape of velocity profiles). We then tested the effect of motor noise as it is known play on role on arm movement control (Todorov and Jordan, 2002; van Beers et al., 2004). Results are reported in Figure 8b. Signal-dependent noise did not change the overall sigmoidal shape of the CoT but just tended to increase the CoT magnitude.

Figure 8.

Figure 8.

Cost of time consistency (for participant P1, with the affine fit of the amplitude–duration relationship of Fig. 4). a, CoT when assuming various trajectory costs. Black represents the CoT for the torque change model, here scaled by a 0.05 factor for visualization purpose (infinitesimal CoT on left; integrated CoT on right). Grayscale represents the CoT identified from various trajectory costs (jerk, acceleration, absolute work, and torque). Although the specific values of the CoT necessarily changed with respect to the trajectory cost, its sigmoidal shape was nevertheless very robust. b, CoT when considering signal-dependent noise (kSDN) and a stochastic OC formulation. Increasing multiplicative noise essentially induced an increase of the CoT, but, again, the shape of the CoT remained sigmoidal. c, CoT when modeling muscle dynamics as second-order low-pass filters with different time constants and optimizing a neural effort cost. When still optimizing the torque change, no effects of the muscle dynamics were observed (result not depicted). When optimizing effort, the exact shape of the CoT depended on the underlying muscle dynamics. However, the shape of the infinitesimal CoT still showed a stiff increase after some time until a peak and followed by a slower decrease toward zero. Hence, the systematic nonmonotonicity of g(t) proves that the CoT is neither concave nor convex but possesses a generalized logistic shape. Notably, this conclusion can be drawn from the truly identified part of the curves, which is depicted via solid lines with filled markers. Dotted lines still indicate extrapolated values of the CoT outside of the range of actual measurements (using exponential functions here).

In the preceding simulations, a simple model of the arm mechanics and muscle dynamics was used. We tested the extent to which the sigmoidal shape of the CoT was dependent on the model of the musculoskeletal system. To this aim, we considered that the arm was driven by two antagonist muscles modeled as second-order low-pass filters with specific time constants. When still minimizing the torque change (but controlling the input to muscles), we found that the CoT remained very precisely the same whatever the time constants of the muscles (we tested 0.03–0.15 s). However, when the cost function was modified such that a quadratic cost (i.e., a “neural effort”) was minimized instead of the torque change, the CoT depended on the muscle characteristics (Fig. 8c). Importantly, although the precise values of the CoT obviously varied as a function of the muscle time constants, its sigmoidal shape was nevertheless preserved. Its rate of change g was robustly characterized by a steep increase until a peak was reached and followed by a slower decay on the time interval where the CoT could be reliably sampled. These tests further show that the neural effort cost, yet another plausible candidate trajectory cost known to predict smooth bell-shaped velocity profiles (e.g., Liu and Todorov, 2007; Guigon et al., 2007), also yields a sigmoidal CoT [especially for σ ≈ 0.04 s where velocity profiles are close to minimum jerk ones, which are known to well replicate human velocity profiles; Richardson and Flash (2002)].

Consistency with respect to multijoint dynamics, different initial/final states, and speed instructions

Experimentally, different initial/final arm configurations may lead to the same MT. From the identification standpoint, this may yield several possible values for g(t) where the CoT theory would assume only one. For the above planar single-joint arm movements, this was, however, not a problem. At the theoretical level, the invariance of g with respect to initial/final states follows trivially from the fact that the optimal solutions only depend on the amplitude a, and not specifically on the states x0 and xf. This fact is consistent with experimental observations where MT was found to mostly depend on a and neither specifically on the relative starting hand position in the workspace nor on the leftward/rightward direction of the motion for such horizontal single-joint rotations (Gentili et al., 2007). In general, however, reaching involves a nonlinear plant and several degrees of freedom. The situation is thus more complex for a multiarticulated arm, but our methodology allows for sampling g(t) in the exact same way and effortlessly as soon as the associated fixed-time OC problem can be (numerically) resolved.

For a two-joint arm moving in the horizontal plane, the role of the state/control cost l is reinforced because the hand path depends on it. Attempting to predict MT would be completely irrelevant if the hand path cannot be accurately replicated when MT is known at least. Here we still considered the minimum torque change model for the trajectory cost because it is known to account well for both the quasi-straight path and bell-shaped velocity profile of the hand during planar point-to-point arm movements (Uno et al., 1989). We used the data of Young et al. (2009) and the reported equations for the amplitude–duration relationship to recover the CoT underlying such multijoint arm reaches. The task is illustrated in Figure 9a, and the amplitude–duration mappings at natural and quick speeds are given in Figure 9b. The CoT g was then sampled pointwise from these linear fits using our methodology (Fig. 9c,d), and it was extrapolated out of the range of real measurements. Because Young et al. (2009) did not report any leftward/rightward difference regarding MTs, we used the same affine relationship for leftward and rightward movements performed at natural speed. The infinitesimal CoT differed slightly depending on movement direction. Additionally, varying the starting posture of the arm also led to different instantaneous time costs for leftward hand displacements (Fig. 9c, left). These CoT variations are due to the arm's inertial anisotropy (Gordon et al., 1994) and the fact that the torque change is a dynamic cost. For the sake of completeness, we also identified the CoT for the hand jerk cost (Flash and Hogan, 1985), and this revealed that the CoT remained exactly the same regardless of movement direction and initial posture (data not shown).

Figure 9.

Figure 9.

Cost of time underlying multijoint reaching, as inferred from data reported previously (van Beers et al., 2004; Young et al., 2009). a, Illustration of the task and optimal trajectories as predicted by the model. b, Affine amplitude–duration relationships reported by Young et al. (2009) for natural and quick speeds. c, Infinitesimal CoT g(t) identified for natural speed for rightward (dark gray) and leftward movements starting from different initial positions (black and light gray). d, Infinitesimal CoT identified for quick speed for the central starting position (light gray). Black dots indicate pointwise evaluations of g(t) when considering the center-out reaching task (e) and the corresponding MTs reported by van Beers et al. (2004). e, Center-out reaching task with the hand mobility ellipse depicted. f, MTs predicted by the model when solving free time OC problems using the previously identified CoT (i.e., c, d, light gray traces).

The above variations of CoT were relatively small compared with what was observed during a change of speed instruction (Fig. 9c, right). When asking the participants to move quickly, one can observe that both the slope and the intercept of the affine relationship change drastically (Fig. 9b). This resulted in very different values for the function g. Not only the peak was multiplied by ∼30×, but MT was also penalized slightly earlier (onset time of g). This agrees with our theoretical analysis where we showed that globally rescaling the CoT cannot lead to a change in both slope and intercept of the amplitude–duration relationship.

Hitherto, we have identified the CoT from experimental data. We also assessed the extent to which the CoT identified in a given task could be used to predict MTs in another task. In Figure 9e, a center-out reaching task was thus considered as it has been the subject of extensive investigations (Gordon et al., 1994; van Beers et al., 2004). Using the CoT identified at natural speed for the central starting position and using the relationship between duration and extent for mediolateral movements, it was possible to predict the directional dependence of MT during a center-out task (Fig. 9f). The same could be observed at quick speed using the corresponding CoT. We eventually identified the values of g from the data of van Beers et al. (2004) at quick speed for the 16 movement directions and plotted the corresponding values in Figure 9d (black dots), where the CoT had been identified from the data of Young et al. (2009). The location of the dots around the CoT uncovered at quick speed confirmed the relative directional invariance of the infinitesimal CoT.

Above all, these findings reveal the robust sigmoidal shape of the CoT with respect to modifications of data fitting, trajectory cost, system dynamics, initial/final states, and task instructions, even though its precise values necessarily depend on all the details of modeling. These present results rule out purely concave or convex time costs for reaching, whereas its concavity (e.g., hyperbolic) was confirmed for saccades.

Discussion

We addressed the issue of how the brain selects the duration of movement. We relied upon the “cost of time” theory that finds its roots in the brain's tendency to discount the actual value of future reward (Shadmehr, 2010; Shadmehr et al., 2010; Shadmehr and Mussa-Ivaldi, 2012). It asserts that the time elapsed until action completion entails a cost, thereby making slow moves nonoptimal. By means of a thorough mathematical analysis, we developed a complete, robust, and fairly general methodology to automatically identify the CoT from basic experimental data by sampling its infinitesimal values without making prior assumption about its hypothetical nature. An application to reaching revealed that an overall sigmoidal CoT underlies the planning of arm movement. In the following, we discuss the possible origin, rationale, and implication of such a CoT.

The CoT is a generic but appealing concept that has been recently popularized by Shadmehr's group. In particular, it is straightforward to interpret the CoT as a temporal discounting of reward by obvious mathematical equivalence (Shadmehr, 2010), thereby creating a link between the reward system and motor control (Wolpert and Landy, 2012). In most studies investigating this link, however, humans or animals were motivated via incentives, and gratification was often made explicit to them (Battaglia and Schrater, 2007; Dean et al., 2007; Niv et al., 2007; Hudson et al., 2008; Guitart-Masip et al., 2012) (e.g., food, monetary, token, or sensory information). For the neural control of movement, the actual reward assigned by the brain to any given action is rather an intrinsic and hidden quantity. For example, consider reaching for an ordinary object. It is difficult to predict what reward the brain actually assigns to such a task. Furthermore, if the object is a glass of water, the theory predicts that movement vigor should increase when the individual is thirsty, but exactly knowing the extent to which the intrinsic value of reward is modified seems tricky. Interestingly, the present methodology offers the possibility to robustly estimate and differentially compare such intrinsic rewards. Indeed, the reward attributed to the task can be inferred from the asymptotic value of G. Then it would be possible to record movements under different conditions and analyze how reward varies, all other things being equal.

However, time could be costly in motor control for reasons other than just a temporal discounting of reward. In particular, slow moves might also be undesirable because they monopolize a significant amount of neural resources due to the inevitable processing of the sensorimotor flow of information by the CNS, and neural processing of information is metabolically expensive (Laughlin, 2001; Lennie, 2003; Howarth et al., 2012). The attentional cost associated with slow-paced movement control could also augment and interfere with the proficiency of achieving other tasks, as suggested by dual-task experimental paradigms (Krampe et al., 2010). Hence, the signaling and processing of a quantity of information that likely grows with time during sensorimotor control might induce such a high cognitive load and large neural metabolic energy demand. Interestingly, a growing body of evidence also suggests that parkinsonian motor symptoms may be linked to neural metabolic deficits (Amano et al., 2014). The logistic shape identified for reaching may thus reflect this dual nature of the CoT. Indeed, its sole interpretation as a temporal discounting of reward might be insufficient to account for the inflection of G, and this secondary interpretation as a metabolic cost related to the engagement into a sensorimotor task may be speculated. This is what a recent study tends to confirm for planar arm reaching (Huang and Ahmed, 2012). A surprising constant metabolic cost related to the simple fact of being engaged in a reaching task was discovered. Therefore, the CoT we inferred here may also reflect this part of the metabolic cost that does not depend on the state of the musculoskeletal apparatus but on the underlying neural mechanisms that are involved in on-line movement control and that presumably depend on motion duration. An interesting topic for future research would consist of trying to disentangle these putative reward and metabolic parts of the CoT. This may be possible by tuning the task parameters adequately and by analyzing how the CoT varies accordingly (e.g., Klein-Flügge et al., 2015, where evidence for an inverse sigmoidal effort-related discounting of reward was reported).

It could still be argued that the CoT is just a mathematical epiphenomenon required to compensate for some missing features of the trajectory cost l. Besides a temporal discounting of reward, motion duration could indirectly reflect onto other costs, such as accuracy via constant noise accumulation (van Beers et al., 2004; van Beers, 2008) or metabolic energy expenditure (Alexander, 1997; Huang and Ahmed, 2012). For eye movement, it has been suggested that slow saccades are detrimental to movement accuracy as a consequence of constant noise that accumulates over time (van Beers, 2008). This would lead to a trajectory cost exhibiting a “U” shape as a function of MT because endpoint variability would not only increase at large speeds as a consequence of signal-dependent noise (Harris and Wolpert, 1998) but also at small speeds. This would yield optimal MTs without the need to resort to an explicit CoT. Although plausible for saccades and feedforward control, this view is unsustainable for visually guided reaching because sensory feedback is processed on-line to correct for motor errors. For reaching, this is rather the metabolic energy expenditure that may critically increase during slow-paced movement. Gravity could then be a predominant factor for MT formation. If so, MT should drastically increase in weightless conditions, but it is known that cosmonauts increase only slightly MT after adaptation to weightlessness (Papaxanthis et al., 1998). Moreover, criteria counting the effort required to counteract gravitational torques usually fail to predict realistic arm trajectories even in fixed time (Berret et al., 2011), and models minimizing the metabolic cost of muscle contraction had to resort to parameter tuning to be able to replicate empirical trajectories for a wide range of speeds, making it difficult to estimate metabolic energy expenditure across speeds (Alexander, 1997). Interestingly, the metabolic cost per arm movement reported in Huang and Ahmed (2012) exhibited a “U” shape with respect to motion speed, as for the case of walking (Ralston, 1958; Alexander, 1991), but with the difference that the subject's arm was fully supported by a robotic device. This observation further supports that the cost measured at rest or very low speed was mostly due to a metabolic energy expenditure associated with the fact of being actively involved into a reaching task (which would therefore depend on time only) rather than with gravity itself or with the complex energetics of the muscle contraction (e.g., Kushmerick, 2011). Unfortunately, it seems rather difficult from current knowledge to dissociate the part of metabolic energy expenditure that does depend on arm trajectories (and thus contributes to shaping them) from the one that does not depend on them but just on the passage of time.

Whatever the exact nature of this time cost is (temporal discounting of reward, neural/attentional metabolic cost or even something else, e.g., working memory decay), the present methodology allowed us to reliably infer its global shape for times corresponding to empirical reach and saccade durations. For reaching, the time interval approximately ranged from 300 to 1500 ms, whereas for saccades it ranged from ∼30 to 300 ms. Because the two CoTs exhibited different shapes, the question of their link can be posed. On the one hand, it seems conceivable to argue that the time costs underlying saccades and reaches are just independent because the two systems and their functions are different. On the other hand, it also seems possible to assume a single time cost. Indeed, the two CoTs may be juxtaposed as they only slightly overlap on their respective time intervals. Support for this idea was found when assuming zero duration at zero amplitude: in that case, the CoT for reaching had a shape reminiscent of the one for saccades in the vicinity of small times (typically <200 ms). However, such a similarity was only hypothetical because reach data were unavailable on such a range and whether or not the amplitude–duration relationship passed through the origin was undecidable on the present data. Actually, a regression intercept is often assumed in the literature (e.g., Fitts' law) (Young et al., 2009), but its relatively large value for reaching contrasts with the case of saccades. This discrepancy could be due to the role of feedback (which may require MT extensions to improve terminal reach accuracy) or to the incompressible duration of skeletal muscles activation/deactivation (which differ from extraocular muscles), making it impossible to attain zero duration as amplitude tends to zero. We nevertheless showed that this point was not critical regarding our main conclusion about the sigmoidal shape of the time cost over the range of real MTs. Further work, however, seems required to assess the extent to which temporal costs inferred from different motor tasks and systems (e.g., eye, wrist, arm, or whole-body) have compatible shapes on overlapping time intervals and whether or not they can be related.

Although the neural substrate of such a CoT is unknown, one may speculate that it is linked with the dopaminergic system because movement vigor is significantly altered in Parkinson's disease (Mazzoni et al., 2007). Indeed, dopamine plays a central role in the BG, a region related to the control of movement gain (extent and speed) and cost functions (Desmurget and Turner, 2008; Schmidt et al., 2008; Shadmehr and Krakauer, 2008; Turner and Desmurget, 2010; Amano et al., 2014). More specifically, a number of studies showed that transient inactivation or ablation of the globus pallidus internus, the principal output nucleus for the BG, reduced movement velocity and acceleration without altering hand path, reach accuracy, or movement sequencing (Anderson and Horak, 1985; Mink and Thach, 1991; Inase et al., 1996; Desmurget and Turner, 2010). In addition, the striatum could play a critical role in MT because it has been shown to be related to the temporal discounting of reward (Kobayashi and Schultz, 2008). Therefore, valuation of MT could be partly encoded in that area as it is a fundamental center to gauge effort-benefit situations (Croxson et al., 2009) even in the absence of extrinsic reward (Schouppe et al., 2014). It would be the purpose of future studies to investigate the neural correlates of time cost variations arising from task modifications. The present robust and automated methodology may offer such an opportunity by providing the genuine shape of CoT in neural motor control.

Footnotes

This work was supported by a public grant overseen by the French National Research Agency as part of the Investissement d'Avenir program, through the iCODE Institute project funded by the IDEX Paris-Saclay, ANR-11-IDEX-0003-02. We thank Thomas Deroche, Carole Castanier, and Jean Jeuvrey for help during the acquisition of the experimental data; and François Bonnetblanc, Francesco Nori, and Jérémie Gaveau for useful comments on earlier versions of the manuscript.

The authors declare no competing financial interests.

References

  1. Alexander RM. Energy-saving mechanisms in walking and running. J Exp Biol. 1991;160:55–69. doi: 10.1242/jeb.160.1.55. [DOI] [PubMed] [Google Scholar]
  2. Alexander RM. A minimum energy cost hypothesis for human arm trajectories. Biol Cybern. 1997;76:97–105. doi: 10.1007/s004220050324. [DOI] [PubMed] [Google Scholar]
  3. Amano S, Kegelmeyer D, Hong SL. Rethinking energy in parkinsonian motor symptoms: a potential role for neural metabolic deficits. Front Syst Neurosci. 2014;8:242. doi: 10.3389/fnsys.2014.00242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Anderson ME, Horak FB. Influence of the globus pallidus on arm movements in monkeys: III. Timing of movement-related information. J Neurophysiol. 1985;54:433–448. doi: 10.1152/jn.1985.54.2.433. [DOI] [PubMed] [Google Scholar]
  5. Baraduc P, Thobois S, Gan J, Broussolle E, Desmurget M. A common optimization principle for motor execution in healthy subjects and parkinsonian patients. J Neurosci. 2013;33:665–677. doi: 10.1523/JNEUROSCI.1482-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Battaglia PW, Schrater PR. Humans trade off viewing time and movement duration to improve visuomotor accuracy in a fast reaching task. J Neurosci. 2007;27:6984–6994. doi: 10.1523/JNEUROSCI.1309-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Ben-Itzhak S, Karniel A. Minimum acceleration criterion with constraints implies bang-bang control as an underlying principle for optimal trajectories of arm reaching movements. Neural Comput. 2008;20:779–812. doi: 10.1162/neco.2007.12-05-077. [DOI] [PubMed] [Google Scholar]
  8. Benson DA, Huntington GT, Thorvaldsen TP, Rao AV. Direct trajectory optimization and costate estimation via an orthogonal collocation method. J Guidance Control Dynamics. 2006;29:1435–1440. doi: 10.2514/1.20478. [DOI] [Google Scholar]
  9. Berardelli A, Rothwell JC, Thompson PD, Hallett M. Pathophysiology of bradykinesia in parkinson's disease. Brain. 2001;124:2131–2146. doi: 10.1093/brain/124.11.2131. [DOI] [PubMed] [Google Scholar]
  10. Berret B, Darlot C, Jean F, Pozzo T, Papaxanthis C, Gauthier JP. The inactivation principle: mathematical solutions minimizing the absolute work and biological implications for the planning of arm movements. PLoS Comput Biol. 2008;4:e1000194. doi: 10.1371/journal.pcbi.1000194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Berret B, Chiovetto E, Nori F, Pozzo T. Evidence for composite cost functions in arm movement planning: an inverse optimal control approach. PLoS Comput Biol. 2011;7:e1002183. doi: 10.1371/journal.pcbi.1002183. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Botzer L, Karniel A. A simple and accurate onset detection method for a measured bell-shaped speed profile. Front Neurosci. 2009;3:61. doi: 10.3389/neuro.20.002.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Brown SH, Hefter H, Mertens M, Freund HJ. Disturbances in human arm movement trajectory due to mild cerebellar dysfunction. J Neurol Neurosurg Psychiatry. 1990;53:306–313. doi: 10.1136/jnnp.53.4.306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Brücke C, Huebl J, Schönecker T, Neumann WJ, Yarrow K, Kupsch A, Blahak C, Lütjens G, Brown P, Krauss JK, Schneider GH, Kühn AA. Scaling of movement is related to pallidal gamma oscillations in patients with dystonia. J Neurosci. 2012;32:1008–1019. doi: 10.1523/JNEUROSCI.3860-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Collewijn H, Erkelens CJ, Steinman RM. Binocular co-ordination of human horizontal saccadic eye movements. J Physiol. 1988;404:157–182. doi: 10.1113/jphysiol.1988.sp017284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Croxson PL, Walton ME, O'Reilly JX, Behrens TE, Rushworth MF. Effort-based cost-benefit valuation and the human brain. J Neurosci. 2009;29:4531–4541. doi: 10.1523/JNEUROSCI.4515-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Dean M, Wu SW, Maloney LT. Trading off speed and accuracy in rapid, goal-directed movements. J Vis. 2007;7:10.1–1012. doi: 10.1167/7.5.10. [DOI] [PubMed] [Google Scholar]
  18. Desmurget M, Turner RS. Testing basal ganglia motor functions through reversible inactivations in the posterior internal globus pallidus. J Neurophysiol. 2008;99:1057–1076. doi: 10.1152/jn.01010.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Desmurget M, Turner RS. Motor sequences and the basal ganglia: kinematics, not habits. J Neurosci. 2010;30:7685–7690. doi: 10.1523/JNEUROSCI.0163-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Engelbrecht SE. Minimum principles in motor control. J Math Psychol. 2001;45:497–542. doi: 10.1006/jmps.2000.1295. [DOI] [PubMed] [Google Scholar]
  21. Ferrante A, Marro G, Ntogramatzidis L. A parameterization of the solutions of the finite-horizon LQ problem with general cost and boundary conditions. Automatica. 2005;41:1359–1366. doi: 10.1016/j.automatica.2005.01.018. [DOI] [Google Scholar]
  22. Fitts PM. The information capacity of the human motor system in controlling the amplitude of movement. J Exp Psychol. 1954;47:381–391. doi: 10.1037/h0055392. [DOI] [PubMed] [Google Scholar]
  23. Flash T, Hogan N. The coordination of arm movements: an experimentally confirmed mathematical model. J Neurosci. 1985;5:1688–1703. doi: 10.1523/JNEUROSCI.05-07-01688.1985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Garg D, Patterson MA, Hager WW, Rao AV, Benson DA, Huntington GT. A unified framework for the numerical solution of optimal control problems using pseudospectral methods. Automatica. 2010;46:1843–1851. doi: 10.1016/j.automatica.2010.06.048. [DOI] [Google Scholar]
  25. Gaveau J, Berret B, Demougeot L, Fadiga L, Pozzo T, Papaxanthis C. Energy-related optimal control accounts for gravitational load: comparing shoulder, elbow, and wrist rotations. J Neurophysiol. 2014;111:4–16. doi: 10.1152/jn.01029.2012. [DOI] [PubMed] [Google Scholar]
  26. Gentili R, Cahouet V, Papaxanthis C. Motor planning of arm movements is direction-dependent in the gravity field. Neuroscience. 2007;145:20–32. doi: 10.1016/j.neuroscience.2006.11.035. [DOI] [PubMed] [Google Scholar]
  27. Gill PE, Murray W, Saunders MA. Snopt: an SQP algorithm for large-scale constrained optimization. SIAM Rev. 2005;47:99–131. doi: 10.1137/S0036144504446096. [DOI] [Google Scholar]
  28. Gordon J, Ghilardi MF, Cooper SE, Ghez C. Accuracy of planar reaching movements: II. Systematic extent errors resulting from inertial anisotropy. Exp Brain Res. 1994;99:112–130. doi: 10.1007/BF00241416. [DOI] [PubMed] [Google Scholar]
  29. Gottlieb GL, Corcos DM, Agarwal GC. Strategies for the control of voluntary movements with one mechanical degree of freedom. Behav Brain Sci. 1989;12:189–250. doi: 10.1017/S0140525X00048238. [DOI] [Google Scholar]
  30. Guigon E, Baraduc P, Desmurget M. Computational motor control: redundancy and invariance. J Neurophysiol. 2007;97:331–347. doi: 10.1152/jn.00290.2006. [DOI] [PubMed] [Google Scholar]
  31. Guitart-Masip M, Chowdhury R, Sharot T, Dayan P, Duzel E, Dolan RJ. Action controls dopaminergic enhancement of reward representations. Proc Natl Acad Sci U S A. 2012;109:7511–7516. doi: 10.1073/pnas.1202229109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Haith AM, Reppert TR, Shadmehr R. Evidence for hyperbolic temporal discounting of reward in control of movements. J Neurosci. 2012;32:11727–11736. doi: 10.1523/JNEUROSCI.0424-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Harris CM, Wolpert DM. Signal-dependent noise determines motor planning. Nature. 1998;394:780–784. doi: 10.1038/29528. [DOI] [PubMed] [Google Scholar]
  34. Harris CM, Wolpert DM. The main sequence of saccades optimizes speed-accuracy trade-off. Biol Cybern. 2006;95:21–29. doi: 10.1007/s00422-006-0064-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Hoff B. A model of duration in normal and perturbed reaching movement. Biol Cybern. 1994:481–488. [Google Scholar]
  36. Hogan N. An organizing principle for a class of voluntary movements. J Neurosci. 1984;4:2745–2754. doi: 10.1523/JNEUROSCI.04-11-02745.1984. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Howarth C, Gleeson P, Attwell D. Updated energy budgets for neural computation in the neocortex and cerebellum. J Cereb Blood Flow Metab. 2012;32:1222–1232. doi: 10.1038/jcbfm.2012.35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Huang HJ, Ahmed AA. Is there a reaching speed that minimizes metabolic cost?. Proceedings of the Translational and Computational Motor Control; 2012; New Orleans. 2012. [Google Scholar]
  39. Hudson TE, Maloney LT, Landy MS. Optimal compensation for temporal uncertainty in movement planning. PLoS Comput Biol. 2008;4:e1000130. doi: 10.1371/journal.pcbi.1000130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Huh D, Todorov E, Sejnowski TJ. Infinite horizon optimal control framework for goal directed movements. Society for Neuroscience Annual Meeting; 2010. Online: Program 492.411. [Google Scholar]
  41. Inase M, Buford JA, Anderson ME. Changes in the control of arm position, movement, and thalamic discharge during local inactivation in the globus pallidus of the monkey. J Neurophysiol. 1996;75:1087–1104. doi: 10.1152/jn.1996.75.3.1087. [DOI] [PubMed] [Google Scholar]
  42. Kappen HJ. Optimal control theory and the linear bellman equation. In: Barber D, Cemgil AT, Chiappa S, editors. Bayesian time series models. Cambridge: Cambridge UP Books Online; 2011. pp. 363–387. [Google Scholar]
  43. Kardamakis AA, Moschovakis AK. Optimal control of gaze shifts. J Neurosci. 2009;29:7723–7730. doi: 10.1523/JNEUROSCI.5518-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Kirk DE. Optimal control theory: an introduction. Upper Saddle River, NJ: Prentice-Hall; 1970. [Google Scholar]
  45. Klein-Flügge MC, Kennerley SW, Saraiva AC, Penny WD, Bestmann S. Behavioral modeling of human choices reveals dissociable effects of physical effort and temporal delay on reward devaluation. PLoS Comput Biol. 2015;11:e1004116. doi: 10.1371/journal.pcbi.1004116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kobayashi S, Schultz W. Influence of reward delays on responses of dopamine neurons. J Neurosci. 2008;28:7837–7846. doi: 10.1523/JNEUROSCI.1600-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Krampe RT, Doumas M, Lavrysen A, Rapp M. The costs of taking it slowly: fast and slow movement timing in older age. Psychol Aging. 2010;25:980–990. doi: 10.1037/a0020090. [DOI] [PubMed] [Google Scholar]
  48. Kushmerick MJ. Energetics of muscle contraction. Comprehensive physiology, supplement 27: handbook of physiology, skeletal muscle. 2011:189–236. doi: 10.1002/cphy.cp100107. First published in 1983. [DOI] [Google Scholar]
  49. Laughlin SB. Energy as a constraint on the coding and processing of sensory information. Curr Opin Neurobiol. 2001;11:475–480. doi: 10.1016/S0959-4388(00)00237-3. [DOI] [PubMed] [Google Scholar]
  50. Lee EB, Markus L. Foundations of optimal control theory. New York: John Wiley; 1967. [Google Scholar]
  51. Lennie P. The cost of cortical computation. Curr Biol. 2003;13:493–497. doi: 10.1016/S0960-9822(03)00135-0. [DOI] [PubMed] [Google Scholar]
  52. Liu D, Todorov E. Evidence for the flexible sensorimotor strategies predicted by optimal feedback control. J Neurosci. 2007;27:9354–9368. doi: 10.1523/JNEUROSCI.1110-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Mazzoni P, Hristova A, Krakauer JW. Why don't we move faster? Parkinson's disease, movement vigor, and implicit motivation. J Neurosci. 2007;27:7105–7116. doi: 10.1523/JNEUROSCI.0264-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Mink JW, Thach WT. Basal ganglia motor control: III. Pallidal ablation: normal reaction time, muscle cocontraction, and slow movement. J Neurophysiol. 1991;65:330–351. doi: 10.1152/jn.1991.65.2.330. [DOI] [PubMed] [Google Scholar]
  55. Mombaur K, Truong A, Laumond JP. From human to humanoid locomotion: an inverse optimal control approach. Autonomous Robots. 2010;28:369–383. doi: 10.1007/s10514-009-9170-7. [DOI] [Google Scholar]
  56. Myerson J, Green L. Discounting of delayed rewards: models of individual choice. J Exp Anal Behav. 1995;64:263–276. doi: 10.1901/jeab.1995.64-263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Nakano E, Imamizu H, Osu R, Uno Y, Gomi H, Yoshioka T, Kawato M. Quantitative examinations of internal representations for arm trajectory planning: minimum commanded torque change model. J Neurophysiol. 1999;81:2140–2155. doi: 10.1152/jn.1999.81.5.2140. [DOI] [PubMed] [Google Scholar]
  58. Nelson WL. Physical principles for economies of skilled movements. Biol Cybern. 1983;46:135–147. doi: 10.1007/BF00339982. [DOI] [PubMed] [Google Scholar]
  59. Niv Y, Daw ND, Joel D, Dayan P. Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology (Berl) 2007;191:507–520. doi: 10.1007/s00213-006-0502-4. [DOI] [PubMed] [Google Scholar]
  60. Nori F, Frezza R. Linear optimal control problems and quadratic cost functions estimation. 12th Mediterranean Conference on Control and Automation, MED'04; Kusadasi, Aydin, Turkey. 2004. [Google Scholar]
  61. Papaxanthis C, Pozzo T, Popov KE, McIntyre J. Hand trajectories of vertical arm movements in one-G and zero-G environments: evidence for a central representation of gravitational force. Exp Brain Res. 1998;120:496–502. doi: 10.1007/s002210050423. [DOI] [PubMed] [Google Scholar]
  62. Pontryagin LS, Boltyanskii VG, Gamkrelidze RV, Mishchenko EF. The mathematical theory of optimal processes. Oxford: Pergamon; 1964. [Google Scholar]
  63. Ralston HJ. Energy-speed relation and optimal speed during level walking. Int Z Angew Physiol. 1958;17:277–283. doi: 10.1007/BF00698754. [DOI] [PubMed] [Google Scholar]
  64. Rao AV, Benson DA, Darby CL, Patterson MA, Francolin C, Sanders I, Huntington GT. Algorithm 902: GPOPS, a MATLAB software for solving multiple-phase optimal control problems using the gauss pseudospectral method. ACM Trans Math Software. 2010;37:1–39. doi: 10.1145/1644001.1644009. [DOI] [Google Scholar]
  65. Richardson MJ, Flash T. Comparing smooth arm movements with the two-thirds power law and the related segmented-control hypothesis. J Neurosci. 2002;22:8201–8211. doi: 10.1523/JNEUROSCI.22-18-08201.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Rigoux L, Guigon E. A model of reward- and effort-based optimal decision making and motor control. PLoS Comput Biol. 2012;8:e1002716. doi: 10.1371/journal.pcbi.1002716. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Schmidt L, d'Arc BF, Lafargue G, Galanaud D, Czernecki V, Grabli D, Schüpbach M, Hartmann A, Lévy R, Dubois B, Pessiglione M. Disconnecting force from money: effects of basal ganglia damage on incentive motivation. Brain. 2008;131:1303–1310. doi: 10.1093/brain/awn045. [DOI] [PubMed] [Google Scholar]
  68. Schouppe N, Demanet J, Boehler CN, Ridderinkhof KR, Notebaert W. The role of the striatum in effort-based decision-making in the absence of reward. J Neurosci. 2014;34:2148–2154. doi: 10.1523/JNEUROSCI.1214-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Scott SH. Optimal feedback control and the neural basis of volitional motor control. Nat Rev Neurosci. 2004;5:532–546. doi: 10.1038/nrn1427. [DOI] [PubMed] [Google Scholar]
  70. Shadmehr R. Control of movements and temporal discounting of reward. Curr Opin Neurobiol. 2010;20:726–730. doi: 10.1016/j.conb.2010.08.017. [DOI] [PubMed] [Google Scholar]
  71. Shadmehr R, Krakauer JW. A computational neuroanatomy for motor control. Exp Brain Res. 2008;185:359–381. doi: 10.1007/s00221-008-1280-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Shadmehr R, Mussa-Ivaldi S. Biological learning and control. Cambridge, MA: Massachusetts Institute of Technology; 2012. [Google Scholar]
  73. Shadmehr R, Orban de Xivry JJ, Xu-Wilson M, Shih TY. Temporal discounting of reward and the cost of time in motor control. J Neurosci. 2010;30:10507–10516. doi: 10.1523/JNEUROSCI.1343-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Stengel R. Dover books on advanced mathematics. New York: Dover; 1986. Optimal control and estimation. [Google Scholar]
  75. Sutton RS, Barto AG. Reinforcement learning: an introduction. Cambridge, MA: Massachusetts Institute of Technology; 1998. [Google Scholar]
  76. Tanaka H, Krakauer JW, Qian N. An optimization principle for determining movement duration. J Neurophysiol. 2006;95:3875–3886. doi: 10.1152/jn.00751.2005. [DOI] [PubMed] [Google Scholar]
  77. Todorov E. Optimality principles in sensorimotor control. Nat Neurosci. 2004;7:907–915. doi: 10.1038/nn1309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Todorov E. Optimal control theory. In: Doya K, editor. Bayesian brain: probabilistic approaches to neural coding. Cambridge: MIT; 2006. pp. 269–298. [Google Scholar]
  79. Todorov E, Jordan MI. Optimal feedback control as a theory of motor coordination. Nat Neurosci. 2002;5:1226–1235. doi: 10.1038/nn963. [DOI] [PubMed] [Google Scholar]
  80. Turner RS, Desmurget M. Basal ganglia contributions to motor control: a vigorous tutor. Curr Opin Neurobiol. 2010;20:704–716. doi: 10.1016/j.conb.2010.08.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Uno Y, Kawato M, Suzuki R. Formation and control of optimal trajectory in human multijoint arm movement: minimum torque-change model. Biol Cybern. 1989;61:89–101. doi: 10.1007/BF00204593. [DOI] [PubMed] [Google Scholar]
  82. van Beers RJ. Saccadic eye movements minimize the consequences of motor noise. PLoS One. 2008;3:e2070. doi: 10.1371/journal.pone.0002070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. van Beers RJ, Haggard P, Wolpert DM. The role of execution noise in movement variability. J Neurophysiol. 2004;91:1050–1063. doi: 10.1152/jn.00652.2003. [DOI] [PubMed] [Google Scholar]
  84. Van der Helm FCT, Rozendaal LA. Musculoskeletal systems with intrinsic and proprioceptive feedback. In: Winters JM, editor. Biomechanics and neural control of posture and movement. New York: Springer; 2000. pp. 164–174. [Google Scholar]
  85. Winter D. Biomechanics and motor control of human movement. New York: Wiley; 1990. [Google Scholar]
  86. Wolpert DM, Landy MS. Motor control is decision-making. Curr Opin Neurobiol. 2012;22:996–1003. doi: 10.1016/j.conb.2012.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Young SJ, Pratt J, Chau T. Target-directed movements at a comfortable pace: movement duration and Fitts's law. J Mot Behav. 2009;41:339–346. doi: 10.3200/JMBR.41.4.339-346. [DOI] [PubMed] [Google Scholar]

Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES