Abstract
Studies of reaching suggest that humans adapt to novel arm dynamics by building internal models that transform planned sensory states of the limb, e.g., desired limb position and its derivatives, into motor commands, e.g., joint torques. Earlier work modeled this computation via a population of basis elements and used system identification techniques to estimate the tuning properties of the bases from the patterns of generalization. Here we hypothesized that the neural representation of planned sensory states in the internal model might resemble the signals from the peripheral sensors. These sensors normally encode the limb's actual sensory state in which movement errors occurred. We developed a set of equations based on properties of muscle spindles that estimated spindle discharge as a function of the limb's state during reaching and drawing of circles. We then implemented a simulation of a two-link arm that learned to move in various force fields using these spindle-like bases. The system produced a pattern of adaptation and generalization that accounted for a wide range of previously reported behavioral results. In particular, the bases showed gain-field interactions between encoding of limb position and velocity, very similar to the gain fields inferred from behavioral studies. The poor sensitivity of the bases to limb acceleration predicted behavioral results that were confirmed by experiment. We suggest that the internal model of limb dynamics is computed by the brain with neurons that encode the state of the limb in a manner similar to that expected of muscle spindle afferents.
Keywords: reaching, arm movements, adaptation, force fields, computational models, motor control, motor learning
1. Introduction
To generate reaching movements, the brain appears to compute motor commands based on predicted dynamics of the task (Gottlieb, 1994;Shadmehr and Mussa-Ivaldi, 1994). When the dynamics change unexpectedly, movements exhibit errors. The errors produce unexpected changes in muscle lengths that are transduced by proprioceptive sensors, which in turn evoke short- and long-latency responses that modify the ongoing motor commands. Because of inherent delays in the nervous system, this feedback reduces the error in the ongoing movement but does not eliminate it. To eliminate the error, the brain use the error-driven motor response in one trial to modify the motor commands that initiate the next trial (Thoroughman and Shadmehr, 1999). In theory then, the process of learning the control of limb dynamics is an example of “error-feedback-learning”: reflex circuits transduce the motor error and generate corrective motor commands. These corrections act as the teaching signal for an internal model that computes the subsequent voluntary motor commands (Kawato, 1989; Marsden et al., 1976). The internal model in this case is a sensorimotor map that transforms a desired action (represented in kinematic coordinates) into motor commands (represented as torques), and adaptation is due to supervised learning.
In supervised learning, error signals primarily affect the states in which errors are experienced. However, depending on how states are represented, the effect of error may be local to the state or generalize broadly to neighboring states. For example, if we imagine an internal model as a system that transforms the desired limb position and its derivatives into an estimate of joint torques (Conditt et al., 1997) via a weighted combination of basis elements (Donchin et al., 2003; Thoroughman and Shadmehr, 2000):
(1) |
Then each basis gi encodes some aspect of the limb's sensory state and a population code forms a motor command. In this theoretical framework, the tuning of bases determines generalization. For example, broad tuning with respect to limb position θd predicts that after one experiences error during reaching in one workspace of the hand (say with the shoulder in a flexed posture), the error will generalize to reaches in another workspace far away (with the shoulder in an extended posture) (Shadmehr and Moussavi, 2000). Indeed, this wide pattern of generalization in terms of limb position is a consistent property of adaptation to novel arm dynamics (Malfait et al., 2002).
If the internal model can be viewed as a population coding via a set of bases, then it is important to predict how the bases encode limb state and how that encoding translates into neuronal discharge. Thus far, the approach has been to measure behavior and then try to find tuning functions that produce generalization patterns that are consistent with the measured data. For example, a previous study suggested that in order to account for the observed patterns of generalization in force fields that depended on both position and velocity of the limb, the bases likely encoded these variables multiplicatively: the bases were directionally tuned, but the depth of tuning was linearly modulated with respect to limb position (Hwang et al., 2003). The result predicted that the bases encoded these two variables in a “gain-field”.
Remarkably, a recent paper found such gain-field coding in the tuning of some cells in the motor cortex (Paninski et al., 2004). Earlier, Bizzi and colleagues had reported that during adaptation, changes in the directional tuning of motor cortical cells paralleled changes in the spatial tuning of task related arm muscles (Li et al., 2001). However, because some cells maintained these changes in tuning even after a washout period, it seemed likely that the motor cortex was involved both in the formation of the internal model and its expression. Therefore, the idea of using patterns of generalization to infer shape of the bases and then testing the predictions in the motor cortex appeared promising.
Unfortunately, there are inherent limitations in this approach. In principle, the bases encode every aspect of limb state, including limb acceleration and possibly higher derivatives. The large state space makes it impossible to design psychophysical experiments that can probe generalization patterns along each dimension of state space.
Here we approached the problem from a different direction. Rather than trying to predict tuning of the bases from generalization patterns in behavioral data, we used a physiologically derived prior assumption on the shape of the bases to predict behavior. If reflex based error-feedback acts as a teaching signal to an internal model, then the states in which the errors are experienced are reported to the central nervous system (CNS) in proprioceptive coordinates. Therefore, we hypothesized that the bases that form the internal model code the desired state of the limb with functions that are similar to the coding found in the neural discharge of muscle-spindle afferents.
This assumption imposes strong constraints on the shape of the bases: the discharge of spindle afferents is a scalar function that nonlinearly encodes all aspects of limb state, including position, velocity, acceleration, and even higher derivatives (Hasan, 1983). Here we simulated adaptive systems that used these bases to learn dynamics of reaching. We compared generalization properties of the system with behavioral data and found that a number of psychophysical results could be accounted for by this new model.
2 Model
2.1 Encoding state of the limb in the peripheral nervous system
Several studies have proposed mathematical models of spindle afferent discharge (Hasan, 1983; Houk et al., 1981; Lin and Crago, 2002; Poppele and Boyd, 1970; Prochazka and Gorassini, 1998; Schaafsma et al., 1991). However, one class of models stands out because first, its components are based on physical properties of muscle fibers (i.e., length-tension relationship), and second, the model can explain neuronal discharge during active motion of freely moving animals (Hasan, 1983; Prochazka and Gorassini, 1998a). In this model (Hasan, 1983), spindle afferent discharge is determined by the impact of the state of the muscle on two physically different zones of the muscle spindle: sensory and non-sensory. These two regions are in series. We label the length of the sensory zone z(t) and the length of the non-sensory zone y(t). Because the zones are in series, the tensions are equal. For the non-sensory zone, tension f(t) is a nonlinear function of y(t) and its first derivative , as shown in Eq. 2. For the sensory zone, tension is assumed to be a simple spring, with the tension proportional to the length of the sensory zone (Eq. 3).
(2) |
(3) |
Here, k1 and k2 are length sensitivities for the non-sensory and sensory zones respectively. Length of a spindle is the sum of the lengths of the sensory and non-sensory zones: x(t) = y(t) + z(t). Substituting this equality into Eqs. 2 and 3, after re-arrangement we arrive at a differential equation that describes the relationship between total muscle spindle length x(t) and the sensory zone length z(t):
(4) |
where b = (k1 + k2)/k1. Parameter a is the sensitivity of tension to velocity in the non-sensory zone and parameter b is the ratio of the sum of length sensitivities for both zones and the length sensitivity for the non-sensory zone of the spindle. Parameter c determines the zero length tension and influences the background firing rate.
The receptor potential of the spindle afferent neuron is linearly proportional to the sensory zone length z(t) while the firing rate of the spindle afferent neuron, represented by g(t), is roughly proportional to a linear combination of its potential and its first derivative (Eq. 5). In Hasan's model, the time constant 0.1 was approximated to fit the experimental results (Hasan, 1983).
(5) |
From Eq. 4 we see that at any given time, the sensory zone length z(t) depends on the history of spindle length changes -- not just the current length and velocity of the spindle. Thus, if we define the state of the limb as its position and velocity, we see that the spindle discharge at a given “state” will differ as a function of the immediately preceding history by which the limb arrived at that state. For example, although the length and velocity are held at constant values at the end of stretching, the spindle discharge continues to change, converging to the steady state value.
A wide range of spindle afferent discharge characteristics during both passive and active movements can be reproduced by varying the three parameters, a, b, and c in Eq. 4. For active voluntary movements, Hasan (1983) suggested a=100, b=100 and c=−25 for the static-γ neuron activated primary spindle afferents and a=0.1, b=250, c=−15 for the dynamic-γ neuron activated primary spindle afferents. Because we were concerned with modeling voluntary movements, we used these two sets of parameter values to represent the physiologic range. In comparison, for passive movements the term a is about 50% smaller for dynamic-γ and 50% larger for static-γ neuron activated primary spindle afferents.
Note that these parameter values are kept constant during movement. However, γ motoneuron activity changes during a voluntary movement. How does this affect the ability of the model to account for the time course of spindle discharge? For example, a fast movement involves large changes in α motoneuron activity, with correspondingly large changes in γ motoneuron discharge. In contrast, in slow movements one would expect smaller changes in γ motor neuron discharge. Prochazka and Gorassini (1998a) tested the current model, as well as five other models, against the discharge histories of a population of spindle afferents recorded from the hamstring muscle of freely moving cats walking at slow, medium, and fast speeds. Remarkably, this model with its parameters fixed at constant values accounted for the temporal profile of neural discharge exceedingly well in various speeds (with r-squares of 84%, 85%, and 88% for slow, medium, and fast speeds). Prochazka and Gorassini (1998a) noted that “predictions were not improved by adding components of firing rates that mimicked EMG-linked γ motor action.”
2.2 Spindle-like basis set
Eqs. 4 and 5 provide an approximation of how the peripheral nervous system might encode the kinematic state of a muscle during a voluntary movement. Here our intention is to model the encoding of state of an entire arm during reaching movements. To compute the response for a whole arm movement, it is necessary to transform the multi-dimensional movement trajectory to one-dimensional spindle-length history. For a given point-to-point reaching movement, the change of muscle length varies depending on which joint the muscle crosses over. For example, a muscle spindle embedded in the anterior deltoid that crosses over the shoulder joint would respond mostly to shoulder extension whereas a muscle spindle embedded in the brachioradialis that crosses over the elbow joint would respond mostly to elbow extension. Thus, assigning the joint over which the spindle crosses is equivalent to assigning a preferred direction to each muscle in joint space (Fig. 1).
Length changes in a muscle relate to changes in joint angles via a Jacobian matrix that in turn specifies a moment arm. Although this matrix is a function of joint angles, for small amplitude reaching movements the matrix is approximately constant. Eq. 6 describes a simple linearized process to transform a 2D joint displacement to changes in muscle spindle length (Bhushan and Shadmehr, 1999).
(6) |
θ is a 2×1 vector consisting of shoulder and elbow joint angles. θ0 is the resting joint position, we used θ0 = [1.1 2.0]T radian corresponding to shoulder and elbow angles of 63° and 115° (Fig. 1a), respectively, with a hand position that is 20 cm left and 30 cm in front of the right shoulder point. However, the value for θ0 does not affect the simulation results presented in this paper. θpref is a 2×1 unit vector representing a preferred direction in joint angle space. For example, θpref = [1 0]T corresponds to 0° preferred direction which represents pure shoulder flexion while θpref = [0.7 0.7]T corresponds to 45° preferred direction which responds equally to both shoulder and elbow flexions. The inner product of θpref and Δθ computes the change of joint angle in the preferred direction, reducing the dimension from 2D to 1D. Finally, multiplying the inner product by λ , the moment arm, transforms the joint angle into the muscle spindle length (Fig. 1b).
To calculate x(t) for a point-to-point movement, we computed a minimum jerk trajectory. That is, for a movement from point p1 to point p2 the trajectory, p(t), was calculated using the following equation, p(t) = p1 + (p2−p1){6(t/T)5 − 15(t/T)4 + 10(t/T)3} where t is time and T is the total movement time. This trajectory was translated into a joint angle trajectory using a 2D kinematic transformation and finally converted the joint angle trajectory into the spindle length profile according to Eq. 6. For a given x(t), we numerically solved the differential equation in Eq. 4 using forward-Euler method (step size = 1 ms, implemented in Matlab) to obtain the firing rate as a function of time for each basis element.
The important parameters that determine the tuning property of a spindle-like basis element are a, b, c, λ , and θpref in Eqs. 3-5. As noted earlier, we used the same values that Hasan (1983) suggested for static and dynamic-γ neuron activated primary endings for parameters a, b, and c. The parameters suggested for the secondary endings were not included because its tuning property was very similar to that of static-γ neuron activated primary endings. For the moment arm λ , there are many possible values as many arm muscles with various lengths are involved. In order to cover the whole range of possible moment arms, we chose two extreme values, the smallest (8 mm) and the largest (80 mm) in the moment arm distribution reported in Murray et al. (1995) Note that the unit for the moment arm is in mm. The preferred direction θpref was uniformly distributed in the joint displacement space at π / 8 intervals. Thus the total number of bases (g ) is 2 (two sets of a, b, c) × 2 (two λ 's ) × 16 ( sixteen θpref 's ) = 64. Table 1 summarizes values of the five parameters for all 64 basis elements that we used for simulations in this paper.
Table 1.
a | b | c | λ | θ pref ,j ( j = 0, 1, 2,.., 15 ) | |
---|---|---|---|---|---|
g1,j | 100 | 100 | −25 | 80 | |
g2,j | 100 | 100 | −25 | 8 | |
g3,j | 0.1 | 250 | −15 | 80 | |
g4,j | 0.1 | 250 | −15 | 8 |
3. Results
3.1 Tuning functions
In order to examine the tuning properties of the spindle-like basis elements, we simulated standard center-out reaching tasks and computed the activities of basis elements during reaching.
3.1.1 Task
Figure 2a shows the spatial configuration for the reaching simulation. The center-out reaching was made on a horizontal plane at shoulder level. Nine different center positions were located in a 3×3 grid (9 grey dots). The grey dot in the middle was aligned with the body midline and 43 cm away from the center of the head. At each center position, 10 cm reaching was made in 8 different directions (8 arrows). Initially, the arm stayed at the center location and moved to the target in 500 ms. The trajectory of each reaching was assumed to follow a minimum jerk profile. For a given point-to-point reaching, the change of spindle lengths and the corresponding discharge rate was calculated as described in section 2.2.
3.1.2 Tuning curves
Figure 2b shows the time course of response for an example basis element with a=100, b=100, c=−25, λ =80, and θpref = [cos(π / 8) sin(π / 8)]T during reaching from a middle center position in 8 different directions. These parameters model a typical static-γ neuron activated primary spindle afferent. The preferred direction π/8 in the joint angle space corresponds to mostly shoulder flexion, thus, the most active direction is down and left in this arm configuration. When the arm moves in the preferred direction of this basis, the response is strongly influenced by hand velocity. However, note the small bump in the response at the beginning, when the hand acceleration in the preferred direction is positive, and the small dip in the response at the end, when the acceleration is negative. These correlations of the basis response are not limited to the hand velocity or acceleration because joint angular velocity and acceleration are approximately linearly related to the hand velocity and acceleration in small amplitude movements. We found that the combined sensitivity to velocity and acceleration produced a response in the simulated spindles that was quite similar to the spindle response recorded during wrist motion (Jones et al., 2001) as well as the discharge of some cells recorded in the somatosensory cortex (area 3a) and motor cortex during foot motion (Wise and Tanji, 1981).
The octagon inside of the dashed rectangle in Fig. 2c displays the data in Fig. 2b as a tuning function. The size of the filled circle represents the mean firing rate during the center-hold period and the distance from the center to each vertex represents the mean firing rate while moving in that direction. To display the effect of limb position on the activity of the bases, the eight other sub-plots in Fig. 2c plot the tuning of this basis when the starting location for the reaching movement was shifted to each of the other 8 positions. The activity during the center-hold period monotonically changes as a function of starting location. However, note that the interaction of position sensitivity and directional tuning is multiplicative since the size of the circles and octagons increase concomitantly. Therefore, the simulations predict that a spindle-like basis will have a tuning curve similar to that of a gain-field.
Figures 2d-e show the tuning properties of a spindle-like basis with a=0.1, b=250, c=−15, and λ =80. These parameters model a typical primary spindle afferent activated by a dynamic-γ motor neuron. This basis displays relatively little sensitivity to changes in the starting position of the reach, but strong sensitivity to hand velocity. Regardless, this basis also shows a gain-field like interaction between position and velocity. Whereas previously we had used psychophysical data during learning to argue for necessity of gain-field coding (Hwang et al., 2003), the current results show that this kind of coding may be a reasonable prior assumption on the tuning of the bases. For bases with a smaller moment arm (k=8), arm position modulates directional tuning by a smaller amount and the tuning remains relatively constant across the workspace.
3.2 Adaptation to a velocity dependent curl field
People tend to reach in roughly a straight path from one point to another with a bell shaped velocity profile (Flash and Hogan, 1985; Morasso, 1981). This tendency continues even when their reaching movements are perturbed by an external force field or by modification of arm inertial properties (Krakauer et al., 1999; Lackner and Dizio, 1994; Shadmehr and Mussa-Ivaldi, 1994). A well studied adaptation protocol is for reaching in velocity dependent curl fields that perturb the hand in the direction perpendicular to the movement. We began by asking if the internal model with the spindle-like bases could adapt to the velocity dependent curl field.
3.2.1 Task
The simulated movement was 10 cm long from the center of the horizontal workspace (the grey dot in the middle of 3×3 grid in Figure 2a) towards the body during 500 ms. The magnitude of the external force was proportional to the hand velocity and the direction was perpendicular to the instantaneous velocity, v(t). That is, where is a 2×1 force vector and is a 2×1 velocity vector.
3.2.2 Representation of the internal model and simulation of arm dynamics
We assumed that the internal model is represented as a population coding via a set of basis elements that have the response function as shown in equation 4. The internal model is:
(7) |
where is the expected environmental torque, gi is a firing rate of each basis element, and wi is a torque vector composed of shoulder torque and elbow torque, corresponding to each basis element. To simulate human arm reaching, we used a simple 2-link system that described the physics of our experimental setup (Shadmehr and Mussa-Ivaldi, 1994).
3.2.3 Adaptation of the internal model
In our simulation, the adaptation of the internal model was implemented by updating weights after each movement so that the squared error of the output from the internal model decreased. The update follows the gradient descent rule, . Here, Δwi is the amount of weight change for the i-th basis element, η is a learning rate, and τenv is the actual environmental torque.
3.2.4 Simulation results
Figure 3 displays the simulated hand paths in the early and late phases of training. In the beginning, as the internal model expects zero external (environmental) forces, large movement errors are produced in the field, whereas almost no errors are observed in the null field condition (Fig. 3a). Figure 3c displays the actual external forces and the predicted external forces by the internal model after 200 movements of training. The correlation coefficient between the actual and predicted forces is 0.98. As the predicted forces come to match the actual external forces closely, the movement errors decrease (Fig. 3b). In parallel, errors during catch trials (i.e., trials in which the field is unexpectedly turned off) increase with training. Therefore, the bases can readily approximate a velocity dependent force pattern.
3.3 Transfer of adaptation from reaching to drawing of circles
Mussa-Ivaldi and colleagues (Conditt et al., 1997) observed that when people adapt their reaching in a force field, the internal model that they acquire generalizes to other tasks like drawing of circles. Here, we tested if our spindle-like basis model could reproduce such experimental result.
3.3.1. Task
In their key experiment, they considered two groups of subjects: one group practiced 10 cm point-to-point reaching to 8 directions in a curl field (typically 0.5 s long), and another group trained by drawing circles in the same field (typically 2 s long). The eight directional point-to-point movements spanned a range of 360°. The configuration of reaching and circular movements is displayed in Fig.4a. Mussa-Ivaldi and colleagues noted that the amplitude and the temporal speed of the circular movements were chosen so that the circles passed through the same positions and velocities as the reaching movements.
They then asked the “reach” subjects to draw circles and compared their performance in circular movements to the performance of the group who were originally trained in circular movements. In circle drawing trials, a template of the circle was provided to subjects on the monitor. While the states (e.g., hand position and velocity) were similar in the reach and circle-drawing tasks, the order in which the states were visited and the time-scale of the movements in the two experiments were very different.
Fig. 4b plots some of the trajectories that they recorded in their experiments. They found that when subjects were asked to draw circles, the “reach” group produced circles that looked very similar to the well-trained “circle” group (Fig. 4b-5 and 6). Similarly, when asked to draw a circle, they found that if they turned off the field the trajectoris of the “reach” group looked very similar to the after-effects recorded from the well-trained “circle” group (Fig. 4b-2 and 3). Thus, reaching resulted in a representation that allowed the subjects to draw circles in the field nearly as well as subjects that had trained in circles.
3.2.2 Simulation results
We simulated this experiment with the same bases described above Figs. 4c-1 and 4 show the trajectories produced by the “naïve” model in a null field and in the force field, respectively. The naïve system produced a fairly accurate circle in the null field (Fig. 4c-1) but a distorted circle in the force field (Fig. 4c-4). The similarity in distortions between model system (Fig. 4c-4) and those recorded from naïve subjects (Fig. 4b-4) suggests that the model represented the task's dynamics reasonably well.
We next trained the model to draw circles in the force field (Fig. 4c-5). When the field was removed, the resulting direct after-effect (Fig. 4c-2) was similar to those recorded from subjects (Fig. 4b-2). Finally, we trained the model to reach to 8 target directions in a curl field (movement period was 0.5 seconds, 192 trials) and then fixed the model weights and drew a circle (6.25 cm radius, period of 2 seconds) in the same field. Fig. 4c-6 shows that the resulting trajectory in the field and Fig. 4c-3 shows the resulting transferred after-effect. We observed significant transfer from the reaching to the circle trials both in terms of similarity of after-effects in the direct and transferred conditions (Fig. 4c-2 and 3), and improvement in ability to draw a circle in the field (Fig. 4c-5 and 6). Over all, the model showed much of the transfer characteristics in reaching and circle drawings reported by Conditt et al. (1997).
3.4 Adaptation and generalization in fields that depend on both position and velocity
Spindle-like bases exhibit a gain-field property, i.e., directional tuning is modulated as a function of limb position (Fig. 2). Basis functions with gain-field properties are thought to be able to account for a series of human psychophysical experimental data (Hwang et al., 2003). Here, we tested if our spindle-like basis model could reproduce such experimental data.
3.4.1 Experimental methods and results
Figure 5a displays the experimental setup. Subjects made movements to the same direction but from three different starting locations, separated by distance d in a horizontal plane. These three different starting locations were randomly intermixed at an equal probability throughout the experiment. Since the direction of these three movements is constant, the joint angular velocity profile of the three movements is identical. A velocity dependent curl force field similar to that in the previous section was applied to the three movements. However, in this experiment, the viscous field was a function of the start location. The coefficient matrices were , and for the left, center and right locations, respectively. The performance of subjects was evaluated by measuring the perpendicular error (p.e.), which is the perpendicular distance from the straight line at 250 ms into the movement. Then, learning index was calculated using . Note that we used the average perpendicular errors across all the catch trials and field trials in the last 84 movement bin for the learning index reported here. When the subject learns this force field completely, the learning index becomes one as the perpendicular error in the field trials becomes zero. The result was that performance in this field improved as the separation distance increased (Fig. 5b, redrawn from Hwang et al. 2003a).
3.4.2 Simulation results
The same 64 spindle-like basis functions that were used in the previous simulation were used here. The subplots in Figure 5c display trial-by-trial perpendicular errors produced by the simulation when the separation distance was 12 cm and 0.5 cm, respectively. Note the increased “noise” in the center movement when d was small. This noise is a result of generalization of errors from movements at “left” and “right” to the “center” movement. A very similar pattern of increased noise was observed in reaching of volunteers (Hwang et al., 2003).
As in the actual experiment, at d=12cm errors for the left and right simulated movements gradually decreased with training. On the other hand, at 0.5cm, movement errors remained almost constant throughout training, showing no adaptation. We quantified the learning performance of simulation for each separation distance using learning index. Figure 5b shows that the learning index from the simulation closely matched the previously reported experimental data (Hwang et al., 2003).
There are two key properties in the bases that account for the specific pattern of generalization shown in Fig. 5b. One property is the multiplicative interaction between position and velocity encodings in the basis elements. As the force field in these simulations is a nonlinear function of limb position and velocity, only bases with a nonlinear interaction can properly approximate the force field. The other property is the degree of position sensitivity of bases. About half of our bases had relatively high position sensitivities because their moment arms were long (λ =80 mm), allowing the force output to be monotonically modulated at a large separation distance. The steeper the average gain of the gain field among bases, the easier it is for the system to learn at smaller d. If the average moment arm that we used to build the model was longer (or shorter), the simulated learning would be better (or worse) than the learning that we had observed in our volunteers.
3.5 Generalization of viscous field learning across a large workspace
Shadmehr and Moussavi (2000) described an experiment where subjects trained in a velocity dependent curl field in one workspace (hand to the far left) and were then tested in a different workspace (hand to the far right). We simulated adaptation in this task using the same bases described earlier.
3.5.1 Experimental methods and results
In one group, subjects were trained to reach to 8 directions in a velocity dependent force field at the left workspace (shoulder flexed position) and then tested to reach to 8 directions in the same field at the right workspace (shoulder extended position). The distance between these two positions was about 80 cm. The performance of these subjects was compared to the performance of naïve subjects who were tested in the same force field at the right workspace without prior training at the left workspace. Figure 6a shows the performance from these two groups, using perpendicular errors. At the right workspace, the performance of the group with the prior training in the left workspace is significantly better than the performance of the naïve group, suggesting generalization of the force field learning across a large workspace.
3.5.1 Simulation results
The model was trained to reach to targets at the left workspace and then tested at the right workspace, 80 cm away from the training location. Figure 6b shows errors in the left and right workspaces for the transfer and naïve conditions in the same format as the experimental data. The model showed transfer across a large workspace, reproducing the experimental result.
Note that in this experiment, the bases generalized very broadly across the workspace. Yet, in the results presented above, the same bases under a different training protocol displayed local spatial generalization, allowing adaptation to take place when opposite fields were present at a distance of 12 cm. What property of the basis set allows such flexibility? The important property is the sensitivity of directional tuning to changes in limb position. When the model is trained in a velocity dependent field at the left workspace, all the bases with relatively large activity at the left workspace contribute to adaptation by changing their corresponding weight vectors. However, this change of weight will be generalized only if the activity of the bases does not decrease significantly at the right workspace. Thus, the observed transfer in the simulation was due to the bases that maintained relatively high activity throughout the workspace. These bases had a gain field that was gentle (λ =8 mm). In contrast, these bases with gentle gain fields did not contribute to the adaptation process shown in Fig. 5 because their activity was almost constant at the left, center and right locations, causing the corresponding weight vectors to adapt to nearly zero, which was the average force across the three locations.
These simulations predict that among the neurons that represent the internal model (perhaps in the motor cortex), adaptation in the task of Fig. 5 produces changes in tuning among mostly the cells that have steep gain fields, whereas adaptation in the task of Fig. 6 will produce changes among cells that have gentle gain fields. The proportion of steep to gentle gain field cells is the most important factor that determines how close two movements can be before adaptation to opposite forces becomes difficult.
3.6 Adaptation to an acceleration dependent curl field
In our theoretical framework, the adaptation ability of the system depends on the shape of the bases (Atkeson, 1989). The output of the spindle-like bases during a point-to-point movement is highly correlated with the velocity profile but is less representative of limb acceleration (Fig. 2a). Thus, our simulated adaptive system will adapt more easily to a velocity dependent field than an acceleration dependent one.
3.6.1 Model prediction
To demonstrate this idea, we simulated adaptation to an acceleration dependent curl field, where is a 2×1 acceleration vector. The movements were 15 cm long, straight toward the body in 550 ms. Figure 7a and b show simulated hand paths from the early and late training phases. The acceleration dependent curl field switches the direction of perturbation in the middle of a movement as the hand starts decelerating. For example, for the downward movement in Figure 7a, the force pushes the hand to the right while the hand accelerates in the first half of the movement and starts pulling the hand back to the left once the hand decelerates. Because of this change of force direction, this force field causes a large spiral movement when the internal model expects zero external force (Figure 7a). However, with training, the internal model adapts to approximate the external force (correlation coefficient = 0.87). Figure 7c shows that the internal model learned to produce a force profile that was correlated to the external force. Accordingly, simulated movement in the late training phase became straighter and produced a smaller spiral.
However, the approximated force for the acceleration dependent field was not as good as the approximated force for the velocity dependent field. This discrepancy occurs despite the fact that training in the acceleration-dependent field (480 trials) lasted more than twice as long as in the velocity dependent field (200 trials). Therefore, although an internal model with spindle-like bases could adapt to an acceleration dependent force, the model predicts that the adaptation rate will be much slower than in a velocity dependent field.
3.6.2 Experimental methods
We tested this prediction by training six subjects to adapt to both a velocity dependent curl field, , and an acceleration dependent curl field, . The sizes of the matrices were chosen so that for a typical movement, the peak force was similar. Subjects repeatedly made 15 cm movements from the center of the workspace toward a single target in two different sessions; one in a velocity dependent force field and the other in a acceleration dependent force field. We counter-balanced the order of training and compared the adaptation rates.
In order to quantify the degree of adaptation, we measured the force that the subject produced on the handle in a small number of trials where the robot produced a force channel (Scheidt et al., 2000). The robot creates a force channel by exerting a force on the hand of the subject that is equal and opposite to the component of the force the subject exerts on the robot's handle perpendicular to the direction of motion. Therefore, the hand will travel near a straight line to the target. However, the force that the subject produces perpendicular to the direction of motion is precisely the force that the CNS predicts to be required for compensation of the “expected” field. In this way, Scheidt et al. (2000) introduced an elegant way of measuring the output of the internal model.
In each trial, we measured the force temporal profile that the subjects exerted on the handle and linearly regressed that force to the ideal compensation force (which was equal and opposite force to the external perturbation force). For example, if the subject produced the perfect compensation force, these two forces will be identical and the coefficient (the slope) of the linear regression would be one.
3.6.3 Experimental results
Figure 8 shows the coefficient of this linear regression as a function of trial number. To compare subjects' learning performance in the two different fields, we computed a performance index by averaging the linear regression coefficients across trial numbers and subtracted the performance index of the acceleration dependent field from that of the velocity dependent field. The mean difference of the performance indices was 0.15±0.08 (mean ± one standard deviation), suggesting that the subjects learned the velocity dependent field significantly better than the acceleration dependent field (one sided t-test, t=4.72, d.f.=5, p=0.0026). We also measured the time constant of the time series of the linear regression coefficient for each field using an exponential fit: time constant of velocity dependent field, τ = 49.4 ± 7.5 , 95% confidence interval, r2 = 0.63; time constant of acceleration dependent field, τ = 71.8 ± 16.8 , r2 = 0.53. The significantly slower adaptation rate in the acceleration dependent field is consistent with the poor coding of this signal among the spindle-like bases.
4 Discussion
We hypothesized that the neural encoding of the internal model of arm dynamics in the brain might be via elements that represent state of the limb with functions that are similar to the coding found in the discharge of muscle spindle afferents. Two ideas motivated this hypothesis: 1) reflex based error-feedback from the spinal cord might act as a teaching signal to an internal model in the brain (Kawato, 1989; Thoroughman and Shadmehr, 1999). In this situation, the states in which errors are experienced are reported to the CNS in proprioceptive coordinates; and 2) the desired sensory state in a sensorimotor transformation may be represented in the CNS in terms of the response that the sensory system transmits when the desired state is actually attained. To test the hypothesis, we formed a basis set in which the response of each basis resembled the response of muscle spindles in whole arm movements. Some of the basis functions showed gain-field like tuning properties, i.e. nonlinear combinations of velocity and position coding. We found that the spindle-like representation could reproduce behavioral results from adaptation studies in velocity and position dependent force fields. Because the spindle-like bases were affected by limb acceleration but this coding was not as significant a factor as the sensitivity to limb velocity, the model predicted and we confirmed poorer ability to learn an acceleration dependent field.
There are a number of short-comings in our model. The spindle model that we used (Hasan, 1983) assumed a constant level of γ motoneuron activity during a movement. This is an oversimplification, of course, yet it is quite remarkable how well the model reproduced the activity of spindle afferents during active movements. For example, Prochazka and Gorassini (1998a) estimated that during free walking of a cat at various speeds, the current model could account for roughly 85% of the variance in the temporal discharge. Neural recordings from human wrist muscle spindle afferents show very similar discharge patterns to that shown in Fig. 1 (Jones et al., 2001). However, the model's ability to predict spindle discharge is not universal. Prochazka and Gorassini (1998b) tested a similar model on the triceps surae muscle of the freely moving cat and found that in case of this muscle, amount of triceps EMG activity that mimicked γ motor action linked to muscle activation did affect the fit of the model, reducing the r-squares by about half (from 0.85 to 0.43). This suggests that the model will improve if it takes into account at least some of the temporal dynamics of γ fusimotor action linked to extrafusal muscle activity (Prochazka and Gorassini, 1998b). Thus, the consideration of γ fusimotor action might change the tuning property and pattern of generalization in some cases.
Spindle afferents are not the only proprioceptive sensors that provide signals regarding muscle states. For example, Golgi tendon organs (GTO) are very sensitive to muscle tension (Gregory et al., 2002; Houk and Simon, 1967). If the desired sensory consequences of limb movement are represented centrally by a neural code that resembles the responses of peripheral sensors, then the representation will include GTO-like bases as well as spindle-like bases. This enriches the bases set to include sensitivity to not only kinematic variables like position and velocity, but also force. For example, a recent report used bias force during reaching as a cue to distinguish between two velocity dependent fields (Kurtzer et al., 2005). The subjects learned the task. The basis set used in the current report cannot learn such a field, but adding GTO-like bases would likely add this ability to the model, and would predict patterns of generalization that can be easily tested.
We used a simple linear conversion from a whole arm movement to a muscle length using a constant moment arm. However, anatomical studies show that the moment arm is not constant and varies as a function of arm orientation (Murray et al., 1995). Furthermore, we distributed the preferred joint displacement uniformly across all directions, but the distribution of preferred directions of cells in the motor cortex depends on arm configuration and is often non-uniform (Scott and Kalaska, 1997). We think that a realistic biomechanical model of the arm could give rise to a spindle-like set of bases that could account for both the behavioral patterns of generalization observed in people and the distribution of preferred directions observed in the motor cortex.
Despite these simplifying assumptions of the spindle model, the idea that adaptation of reaching is largely due to bases that encode limb state in terms of proprioception agrees with much of the current data on behavioral patterns of adaptation. However, there is an important set of data that the model fails to account for. After right-handed individuals train in a force field with their right hand, they exhibit generalization to their left hand (Criscimagna-Hemminger et al., 2003). Of course, spindle-like bases cannot generalize across arms. Interestingly, this generalization is in the extrinsic coordinates of the task. The generalization across arms vanishes when the force field is presented gradually (Malfait and Ostry, 2004), but the gradual presentation does not affect generalization for the trained arm across the workspace (as in Fig. 5). These findings suggest that the generalization across arms may be due to visual cues that arise when large errors are present. Recent results from our lab are consistent with this idea, suggesting that visual cues can lead to awareness of the force patterns (Hwang et al., 2003). Therefore, we believe that the spindle-like bases are a good model of an implicit component of motor learning but cannot account for all aspects of performance. In summary, we report that spindle-like representation of desired limb states in the internal model of arm dynamics appears computationally and physiologically plausible.
Acknowledgements
This work was supported by grants from the NIH (NS46033, NS37422). We thank John Kalaska and Steve Wise for very helpful discussions.
References
- Atkeson CG. Learning arm kinematics and dynamics. Annu Rev Neurosci. 1989;12:157–183. doi: 10.1146/annurev.ne.12.030189.001105. [DOI] [PubMed] [Google Scholar]
- Bhushan N, Shadmehr R. Computational architecture of human adaptive control during learning of reaching movements in force fields. Biol Cybern. 1999;81:39–60. doi: 10.1007/s004220050543. [DOI] [PubMed] [Google Scholar]
- Conditt MA, Gandolfo F, Mussa-Ivaldi FA. The motor system does not learn the dynamics of the arm by rote memorization of past experience. J Neurophysiol. 1997;78:554–560. doi: 10.1152/jn.1997.78.1.554. [DOI] [PubMed] [Google Scholar]
- Criscimagna-Hemminger SE, Donchin O, Gazzaniga MS, Shadmehr R. Learned dynamics of reaching movements generalize from dominant to non-dominant arm. J Neurophysiol. 2003;89:168–176. doi: 10.1152/jn.00622.2002. [DOI] [PubMed] [Google Scholar]
- Donchin O, Francis JT, Shadmehr R. Quantifying generalization from trial-by-trial behavior of adaptive systems that learn with basis functions: theory and experiments in human motor control. J Neurosci. 2003;23:9032–9045. doi: 10.1523/JNEUROSCI.23-27-09032.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flash T, Hogan N. The coordination of arm movements: an experimentally confirmed mathematical model. J Neurosci. 1985;5:1688–1703. doi: 10.1523/JNEUROSCI.05-07-01688.1985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gregory JE, Brockett CL, Morgan DL, Whitehead NP, Proske U. Effect of eccentric muscle contractions on Golgi tendon organ responses to passive and active tension in the cat. J Physiol. 2002;538:209–218. doi: 10.1113/jphysiol.2001.012785. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hasan Z. A model of spindle afferent response to muscle stretch. J Neurophysiol. 1983;48:989–1006. doi: 10.1152/jn.1983.49.4.989. [DOI] [PubMed] [Google Scholar]
- Houk J, Simon W. Responses of Golgi tendon organs to forces applied to muscle tendon. J Neurophysiol. 1967;30:1466–1481. doi: 10.1152/jn.1967.30.6.1466. [DOI] [PubMed] [Google Scholar]
- Houk JC, Rymer WZ, Crago PE. Dependence of dynamic response of spindle receptors on muscle length and velocity. J Neurophysiol. 1981;46:143–166. doi: 10.1152/jn.1981.46.1.143. [DOI] [PubMed] [Google Scholar]
- Hwang E, Smith MA, Shadmehr R. Soc. for Neurosci. Meeting. New Orleans, LA: 2003. Understanding the roles of proprioception and vision in adapting to arm dynamics [822.17] [Google Scholar]
- Hwang EJ, Donchin O, Smith MA, Shadmehr R. A gain-field encoding of limb position and velocity in the internal model of arm dynamics. PLoS Biology. 2003;1:209–220. doi: 10.1371/journal.pbio.0000025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jones KE, Wessberg J, Vallbo AB. Directional tuning of human forearm muscle afferents during voluntary wrist movements. J Physiol. 2001;536:635–647. doi: 10.1111/j.1469-7793.2001.0635c.xd. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kawato M. Adaptation and learning in control of voluntary movement by the central nervous system. Advanced Robotics. 1989;3:229–249. [Google Scholar]
- Krakauer JW, Ghilardi MF, Ghez C. Independent learning of internal models for kinematic and dynamic control of reaching. Nat Neurosci. 1999;2:1026–1031. doi: 10.1038/14826. [DOI] [PubMed] [Google Scholar]
- Kurtzer I, Dizio PA, Lackner JR. Adaptation to a novel multi-force environment. Exp Brain Res. 2005 doi: 10.1007/s00221-005-2216-y. [DOI] [PubMed] [Google Scholar]
- Lackner JR, Dizio P. Rapid adaptation to coriolis force perturbations of arm trajectory. J Neurophysiol. 1994;72:299–313. doi: 10.1152/jn.1994.72.1.299. [DOI] [PubMed] [Google Scholar]
- Li CSR, Padoa-Schioppa C, Bizzi E. Neuronal correlates of motor performance and motor learning in the primary motor cortex of monkeys adapting to an external force field. Neuron. 2001;30:593–607. doi: 10.1016/s0896-6273(01)00301-4. [DOI] [PubMed] [Google Scholar]
- Lin CK, Crago PE. Structural model of the muscle spindle. Ann Biomed Eng. 2002;30:68–83. doi: 10.1114/1.1433488. [DOI] [PubMed] [Google Scholar]
- Malfait N, Ostry DJ. Is interlimb transfer of force-field adaptation a cognitive response to the sudden introduction of load? J Neurosci. 2004;24:8084–8089. doi: 10.1523/JNEUROSCI.1742-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malfait N, Shiller DM, Ostry DJ. Transfer of motor learning across arm configurations. J Neurosci. 2002;22:9656–9660. doi: 10.1523/JNEUROSCI.22-22-09656.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marsden CD, Merton PA, Morton HB. Servo action in the human thumb. J Physiol. 1976;257:1–44. doi: 10.1113/jphysiol.1976.sp011354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morasso P. Spatial control of arm movements. Biol Cybern. 1981;42:223–227. doi: 10.1007/BF00236911. [DOI] [PubMed] [Google Scholar]
- Murray WM, Delp SL, Buchanan TS. Variation of muscle moment arms with elbow and forearm position. J Biomech. 1995;28:513–525. doi: 10.1016/0021-9290(94)00114-j. [DOI] [PubMed] [Google Scholar]
- Paninski L, Shoham S, Fellows MR, Hatsopoulos NG, Donoghue JP. Superlinear population encoding of dynamic hand trajectory in primary motor cortex. J Neurosci. 2004;24:8551–8561. doi: 10.1523/JNEUROSCI.0919-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poppele RE, Boyd LA. Quantitative description of linear behavior of mammalian muscle spindles. J Neurophysiol. 1970;33:59–72. doi: 10.1152/jn.1970.33.1.59. [DOI] [PubMed] [Google Scholar]
- Prochazka A, Gorassini M. Ensemble firing of muscle afferents recorded during normal locomotion in cats. J Neurophysiol. 1998;507:293–304. doi: 10.1111/j.1469-7793.1998.293bu.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prochazka A, Gorassini M. Models of ensemble firing of muscle spindle afferents recorded during normal locomotion in cats. J Neurophysiol. 1998;507:277–291. doi: 10.1111/j.1469-7793.1998.277bu.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schaafsma A, Otten E, Van Willigen JD. A muscle spindle model for primary afferent firing based on a simulation of intrafusal mechanical events. J Neurophysiol. 1991;65:1297–1312. doi: 10.1152/jn.1991.65.6.1297. [DOI] [PubMed] [Google Scholar]
- Scheidt RA, Reinkensmeyer DJ, Conditt MA, Rymer WZ, Mussa-Ivaldi FA. Persistence of motor adaptation during constrained, multi-joint, arm movements. J Neurophysiol. 2000;84:853–862. doi: 10.1152/jn.2000.84.2.853. [DOI] [PubMed] [Google Scholar]
- Scott SH, Kalaska JF. Reaching movements with similar hand paths but different arm orientation: I. Activity of individual cells in motor cortex. J Neurophysiol. 1997;77:826–852. doi: 10.1152/jn.1997.77.2.826. [DOI] [PubMed] [Google Scholar]
- Shadmehr R, Mussa-Ivaldi FA. Adaptive representation of dynamics during learning of a motor task. J Neurosci. 1994;14:3208–3224. doi: 10.1523/JNEUROSCI.14-05-03208.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shadmehr R, Moussavi ZMK. Spatial generalization from learning dynamics of reaching movements. J Neurosci. 2000;20:7807–7815. doi: 10.1523/JNEUROSCI.20-20-07807.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thoroughman KA, Shadmehr R. Electromyographic correlates of learning internal models of reaching movements. J Neurosci. 1999;19:8573–8588. doi: 10.1523/JNEUROSCI.19-19-08573.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thoroughman KA, Shadmehr R. Learning of action through adaptive combination of motor primitives. Nature. 2000;407:742–747. doi: 10.1038/35037588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wise SP, Tanji J. Neuronal responses in sensorimotor cortex to ramp displacements and maintained positions imposed on hindlimb of the unanesthetized monkey. J Neurophysiol. 1981;45:482–500. doi: 10.1152/jn.1981.45.3.482. [DOI] [PubMed] [Google Scholar]