Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2022 Jun 9;18(6):e1010192. doi: 10.1371/journal.pcbi.1010192

Contextual cues are not unique for motor learning: Task-dependant switching of feedback controllers

Justinas Česonis 1, David W Franklin 1,2,3,*
Editor: Adrian M Haith4
PMCID: PMC9217135  PMID: 35679316

Abstract

The separation of distinct motor memories by contextual cues is a well known and well studied phenomenon of feedforward human motor control. However, there is no clear evidence of such context-induced separation in feedback control. Here we test both experimentally and computationally if context-dependent switching of feedback controllers is possible in the human motor system. Specifically, we probe visuomotor feedback responses of our human participants in two different tasks—stop and hit—and under two different schedules. The first, blocked schedule, is used to measure the behaviour of stop and hit controllers in isolation, showing that it can only be described by two independent controllers with two different sets of control gains. The second, mixed schedule, is then used to compare how such behaviour evolves when participants regularly switch from one task to the other. Our results support our hypothesis that there is contextual switching of feedback controllers, further extending the accumulating evidence of shared features between feedforward and feedback control.

Author summary

Extensive evidence has demonstrated that humans can learn distinct motor memories (i.e. independent feedforward controllers) using contextual cues. However, there is little evidence that such contextual cues produce similar separation of feedback controllers. As accumulating evidence highlights the connection between feedforward and feedback control, we propose that context may be used to separate feedback controllers as well. It has not been trivial to test experimentally whether a change in context also modulates the feedback control, as the controller output is affected by other non-contextual factors such as movement kinematics, time-to-target or the properties of the perturbation used to probe the control. Here we present a computational approach based on normative modelling where we separate the effects of the context from other non-contextual effects on the visuomotor feedback system. We then show experimentally that task context independently modulates the feedback control in a particular manner that can be reliably predicted using optimal feedback control.

Introduction

Whether it is touching a hot surface, returning a tennis serve or simply lifting an object, the human body utilises a variety of sensory inputs to produce movements of any complexity. Indeed, different feedback modalities of human motor control, such as stretch reflex [13], vestibulo-ocular reflex [4, 5], visuomotor [614], or even auditory feedback [15, 16] have extensively been studied in prior literature. However, most studies have investigated feedback control in paradigms of either a single task [1721], or multiple tasks presented in their own dedicated blocks [2226]. While such designs provide key insights into the behaviour of the feedback controller in isolation, they are not entirely reflective of human behaviour in real-life situations. For example, a realistic sequence of events could require a volleyball player to first pick up the ball from the ground by reaching for it with their hand and stopping on contact, only then to hit the same ball with the same hand a few moments later while serving. While studying both components independently has received focus in the field of motor control, any interactions between the feedback controllers in the context of rapid switching have not been broadly studied.

While feedback control in human movement is critical in correcting for random errors within movements, feedforward control corrects for movement errors that are predictable. In order to systematically predict and compensate for specific errors upcoming in a given movement, the mechanism of contextual switching via contextual cues is broadly accepted. It is now well understood that performing two opposing tasks in an alternating manner will lead to interference [2729], resulting in behaviour that is averaged between the two tasks, failing to deal with either task. However, if the two tasks are performed in sufficiently different contexts, such as separate physical or visual workspaces [3032], or different lead-in [33, 34] or follow-through movements [35, 36], this interference can be reduced, allowing the formation of two separate motor memories. Hence, it is reasonable to expect, that a similar contextual regulation could be present in feedback controllers. Therefore, in this study we test whether the feedback control policies exhibit such modulation when humans are presented with different tasks in an alternating manner.

One difference between studying contextual switching in feedforward and feedback control is that it is difficult to evaluate whether the feedback control policy has changed after the intervention. Specifically, it has been shown computationally that the optimal feedback controller (OFC) with fixed parameters can produce variable responses when correcting for perturbations within the movement, for example, when the comparable perturbations are induced in different parts (e.g. early or late) of otherwise identical movements [26, 3739]. Furthermore, such behaviour was also observed in experimental studies [7, 20, 26, 3941]. Hence, merely observing a difference in the feedback response is not enough to conclude a change in the control policy. However, recently we demonstrated that as long as two perturbations of the same magnitude are induced at the same time-to-target (which is defined as a difference between the perturbation onset and movement end), the same feedback control policy produces the same magnitude response, independent of whether the two perturbations occurred at the same location, time from the beginning of the movement, or the movement velocity [26]. Thus, we can utilise this relationship between the magnitude (or intensity) of the feedback response to a perturbation at the same time-to-target to quantify whether the difference in the response is due to the change in the control policy or not.

There are several studies that have already looked into contextual regulation in feedback control tasks. Most of such studies, to our knowledge, approached this question by modulating the structure of a target (wide vs. narrow, long vs short, etc.) [4245], or by including obstacles along the reaching path [42, 45]. Results of these studies are consistent with optimal control-like behaviour with separate controllers for different tasks, even when the target structure is changed on random trials [42, 44, 45], however one study suggests that control when the target is unpredictable may be sub-optimal [43]. In this study we test whether such rapid switching also holds true for tasks, where target structure remains unchanged, but the tasks themselves require a change of movement properties. Specifically, we test how the feedback control policies are affected when our participants are presented with a “multitasking” scenario where they have to switch between performing two distinct tasks, i.e. reaching to and stopping at the target, or hitting through the target and stopping behind it. While the two tasks are fundamentally different, and in isolation should require different feedback control policies, here we also test whether the same relationship holds true in the mixed schedule (as it would for contextual switching in feedforward control), or if the interference between two control policies results in a single policy, averaged or weighted between the two independent controllers.

Results

In this study we tested the behaviour of the human feedback controller when switching between two different tasks. Specifically, we presented our human participants with two tasks requiring different control policies—a stop task, where participants had to reach and stop at the target, and a hitting task, where participants had to punch through the target and stop behind it. In our previous work we demonstrated computationally that these two different types of movements trigger feedback responses of different magnitudes, even if the perturbations occur at the same position, time, or time-to-target [26]. However, if the two movements share the same goal (for example the goal of stopping at the same target), then these feedback responses match in magnitude if the time-to-target matches in both movements, irrespective of other movement parameters like peak velocity, movement distance, distance to the target or current velocity. Therefore, such a relation between time-to-target and feedback response intensity could be used to characterise the feedback control policy.

We use the relationship between the time-to-target and the feedback response intensity (which serves as a proxy for feedback controller gain) as a means to analyse the controller behaviour when the task changes. Specifically, we propose two alternatives for the architecture of such control: a single universal feedback controller that exhibits adaptation to a given task (Fig 1A), or multiple task-specific controllers, gated by task context (Fig 1B). When presented with a single task in a blocked schedule (e.g. blocked stop or blocked hit), both the universal controller and task specific controllers are expected to behave similarly, as the universal controller should easily adapt its gains appropriately for the required task. However, if multiple tasks are presented in a mixed schedule (i.e. task can randomly switch from trial to trial), the different control architectures predict different responses. Particularly, a single universal controller would aim to adapt to each presented task, thus on average producing responses somewhere in between the two given tasks within the mixed schedule (Fig 1C). In contrast, a set of task-specific controllers would produce similar responses in the mixed schedule as they would in a blocked schedule, as for every trial an appropriate controller is selected from a set of controllers, rather than being adapted for the task (Fig 1D).

Fig 1. Theoretical predictions of two different architectures for feedback regulation.

Fig 1

A. Universal feedback controller. A single feedback controller is used to produce both stop and hit movements, and is adapted to the given task over multiple trials. Such adaptive behaviour is reminiscent of the behaviour of the feedforward controller when learning two opposing force-fields without separable context. Cn indicates a feedback controller at trial n B. A feedback controller as a set of task-specific controllers. A task-specific controller (stop or hit) is selected based on the task-related context and is used during the given movement. Such contextual switching behaviour is reminiscent of the behaviour of the feedforward controller when learning two opposing force fields with separable context. C. Expected regulation of feedback responses by the universal feedback controller. When exposed to a single task for a long time (blocked schedule) the controller adapts to the given task, producing optimal responses for both stop and hit conditions. However, due to interference within the mixed schedule, such a controller would settle to the average (or weighted average) gains between the two blocked conditions. D. Expected regulation of feedback responses by a set of task specific controllers. Within the blocked schedule, similar regulation is expected between hit and stop as in the case of the universal controller (C). However, in the mixed schedule, due to the task-related context, an appropriate controller is recalled on a trial-to-trial basis, producing similar regulation as within the blocked schedule.

In order to probe the control policies of human participants within these different tasks, we occasionally perturbed participants during the movement by visually shifting the target perpendicular to movement direction and inducing a reactive visuomotor feedback response (Fig 2A). Recently it has become common practice to maintain these perturbations until the end of the movement, such that an active correction is required to successfully complete the trial [19, 26, 41, 4651]. However, we have noticed in our previous work that such maintained perturbations significantly impact the overall time-to-target, which in turn affects the visuomotor feedback gains [26]. Thus, to keep the measurements of visuomotor feedback responses consistent within time-to-target, in this study we only perturbed our participants laterally in channel trials [7, 40, 52] and maintained these perturbations for 250 ms before switching them off, making any corrections redundant. As a result, even when producing the feedback response, participants’ hands are constrained along the path of forward movement, resulting in matching movement durations independent of different perturbation onsets.

Fig 2. Experimental perturbations and responses.

Fig 2

A. Perturbations in stop (left) and hit (right) conditions. Participants performed a forward reaching movement towards a target, positioned 25.0 cm in front of the start position. When the hand crossed one of five evenly spaced locations (dashed lines), a perturbation could be induced by shifting the target by 2 cm laterally for 250 ms and then returned back to the original position. Participants were instructed to either stop at the blue target (stop condition), or hit the red target and stop within the blue rectangle (hit condition). B. Net unscaled feedback responses to the target perturbations in the stop condition, measured via the force channel. Participants produced corrective responses to the target perturbations that varied by different perturbation onsets. Different traces represent different perturbation onsets, with darker colours indicating earlier perturbations. Shaded areas represent one standard error of the mean (SEM). The grey rectangle represents the time window of 180–230 ms, where the visuomotor feedback intensities are measured. C. Net unscaled feedback responses to the target perturbations in the hit condition.

Participants produced involuntary feedback responses to the target jumps. These responses, observed as a lateral force exerted by the participants on the handle of the robotic manipulandum, were modulated by the different perturbation onsets (Fig 2B and 2C). From these force responses we computed feedback intensities, by averaging individual responses over a time window 180 ms–230 ms relative to the perturbation onset on each individual trial. This time window has now been used in numerous studies to quantify such responses and is associated with the involuntary, early visuomotor responses [7, 25, 40, 50, 51].

OFC model predicts differences between hit and stop conditions

We utilised the mixed-horizon OFC [38] model, presented in our earlier work, to generate predictions of feedback control policies in our current study. Due to the experimental design of this study not requiring an extension in movement times after perturbations, the predictions of the mixed-horizon model also matched the predictions of our earlier time-to-target OFC model [26]. In order to compare differences in control throughout hit and stop movements, we first simulated two movement conditions: a 25 cm long movement with 60 cm/s peak velocity and velocity at the target distance <1 cm/s (stop condition), and a similar movement, but with velocity at the target >20 cm/s (hit condition) (Fig 3A). Both models were implemented using a linear quadratic regulator (LQR), and were identical, apart from the difference in state-dependent costs of terminal velocity and terminal force. Here we reduced these cost parameters for the hit model by a factor of 50 in order to reduce the incentive to stop at the target, and thus successfully simulate hit-like movements. In addition, we also simulated a third condition, that we term the long-stop condition, where we used the same position, velocity, force and mean activation costs as in the stop model, but applied for reaching movements of 28 cm. The concept of the long-stop model is to compare the actual hit behaviour, executed through a different controller, with “cheating” behaviour where the same stop movement is performed to an imaginary target, located beyond the actual target, resulting in non-zero velocity at the actual target, and thus appearing as a hit movement. For all three conditions we then induced virtual target perturbations by shifting a target laterally by 2 cm at every time step from movement onset to movement end. With these simulations we obtained one continuous feedback response profile per condition, showing a dependency of feedback response intensity on time-to-target (Fig 3B and 3C). This feedback response profile is characteristic of the particular movement control policy associated with the movement goal, as it is maintained even if the kinematics of the movement change (Figure 8 in [26]).

Fig 3. Comparison of OFC model predictions and experimental results.

Fig 3

A. Simulated kinematics of stop, hit and long-stop conditions. Stop and hit conditions produce matching kinematics that only deviate shortly before movement end. The long-stop condition is a control simulation, that matched the kinematics of the hit condition for the duration of the hit movement, but was achieved with the same stop controller. B. Simulated feedback intensities as a function of time-to-target and C. time-to-movement-end. Simulations predict a faster increase of response intensities for hit condition than for stop condition. As the long-stop condition is simulated via a longer (28 cm) movement, the time-to-target represents a time until the simulated movement crosses a point of 25 cm distance (the target distance). For hit and stop conditions, time-to-target and time-to-movement-end are identical. When expressed against time-to-movement-end, long-stop produces matching responses to the stop condition, as the feedback controller used for these movements is identical. With respect to the time-to-target, long-stop responses are time-shifted from the stop responses. D. Simulated feedback intensities as a function of the position. Stop and hit simulations with these particular kinematics produce matching feedback intensity profiles when expressed against position, even if the feedback controllers are different. In contrast, the long-stop simulation with a feedback controller matching that of the stop condition still produces different intensity profile against position. Shaded areas in simulated traces represent 95% confidence intervals for simulated results. E. Velocity profiles of participants in blocked stop and blocked hit conditions. The profiles match the task requirements. F. Normalised feedback intensity profiles of participants in blocked stop and blocked hit conditions, expressed against time-to-target. Participants produce stronger responses at matching time-to-target in the hit condition, consistent with simulation results for hit and stop. G. Normalised feedback intensity profiles of participants in blocked stop and blocked hit conditions, expressed against position. Participants produce matching responses within hit and stop conditions, supporting model simulations for stop and hit conditions, and not stop and long-stop. Error bars in experimental results represent 95% confidence intervals.

Even with similar simulated kinematics, that deviate from each other only in the last portion of the movement, the OFC model predicts striking differences in the control policies for stop and hit conditions (Fig 3B and 3C, blue and red traces) or hit and long-stop conditions (Fig 3B and 3C, red and green traces) when expressed against time-to-target. On the other hand, when expressed against position, even different controllers (hit vs stop) show no differences in feedback intensities, while identical control (stop vs long-stop) exhibit clear differences (Fig 3D). Among other things, these results point out limitations of position as a dependent variable in determining the changes of control policies, and provide yet additional support for time-to-target.

Our models make a few predictions for the behaviour of human participants. First, it challenges the classic assumption that visuomotor feedback response profiles are always bell shaped, if probed at evenly spaced locations or movement times. Instead, we propose that the bell-shaped feedback response profiles are consequential to the specific kinematic values imposed by the experiments, and other, for example monotonically decreasing intensity profiles, are also possible with faster movements (Fig 3D). Second, our simulations also make predictions on relative differences between the feedback intensity profiles in stop and hit conditions. Particularly, we expect the hit condition to produce stronger responses than the stop condition for short times-to-target, with this relationship inverting for long times-to-target if the two types of movements require different feedback controllers (Fig 3B and 3C). Note, that while in previous studies it is typical to compare such response profiles in terms of perturbation onset location, here no difference between hit and stop is predicted in position-dependent profiles (Fig 3D).

Human control policies match model predictions in hit and stop conditions

In order to compare the behaviour of our participants to the model predictions, we first analysed our results from the blocked schedule of the experiment. Here every participant has completed a block of 416 trials of hit condition and another block of 416 trials of stop condition, with the order counterbalanced across all participants. Our experimental results qualitatively match the predictions of our model. First, participants successfully differentiated between the kinematics of the hit and the stop condition, with both types of movements resulting in matching early and peak velocity (vpeak,stop = 58.9 cm/s, vpeak,hit = 58.1 cm/s), but with differences towards the end of the movement such that the velocity at the target is higher for the hit condition (Fig 3E). Specifically, in the hit condition participants produced movements with average velocity at the target of 38.5 cm/s, while successfully stopping at the target in the stop condition. In addition, similar to the model simulations, movements in the hit condition were of slightly shorter duration (630 ms vs 700 ms).

Qualitatively, the experimental feedback responses also match the model predictions (Fig 3F and 3G). First, due to relatively fast reaching velocities in our experiment, as well as the lack of maintained perturbations, all perturbations were induced at short times-to-target (under 550 ms). For comparison, in our previous study [26] perturbations were induced at times-to-target that ranged between 300 ms and 1000 ms, with peak feedback intensities recorded for perturbations with time-to-target at 400 ms. Second, both our data and the model produce feedback intensities at short times-to-target that are higher for the hit condition than for the stop condition, even in movement segments where the kinematics are otherwise similar. In addition, we also observe no learning effects within this regulation, as the relative behaviour across conditions is present in the first few blocks of the study, and remains throughout the entire experiment (S1 Text). Importantly, we do not fit the model to match the data, but instead use it to qualitatively describe the relative regulation of stop and hit conditions. As such, matching features between the intensity profiles of the model (Fig 3B–3D) and the data (Fig 3F and 3G) imply that similar computational mechanisms may be in action. Finally, our results also indicate that participants utilise different feedback controllers for the hit and stop conditions, as the experimental results for the blocked hit condition match the model simulations of the hit, rather than the long-stop condition.

Human participants utilise contextual switching of feedback controllers

In the previous sections we established the differences between the baseline control policies of hit and stop conditions. Here, we test how these policies change when the exposure to these conditions is no longer blocked. For example, it is natural in our daily activities to continuously switch between tasks, rather than doing a single task for many repetitions before switching to a new task. However, the question remains, how switching between different tasks affects the underlying feedback control policies. To test this, in the second half of the experiment we presented our participants with the same two types of movements (hit and stop), but now with the conditions randomly mixed across trials, instead of being presented in two separate blocks. As such, we could test for one of two possible outcomes:

  1. Control policies for stop and hit movements in the mixed schedule match respectively the control policies in the stop and hit movements in the blocked schedule (Fig 1D). Such an outcome would indicate that participants are able to easily switch between different control policies (at least within consecutive trials).

  2. Control policies for stop and hit movements in the mixed schedule do not match with the respective baseline policies, indicating interference when switching among multiple conditions (Fig 1C).

While both outcomes have previously been discussed from the sensorimotor adaptation perspective, to our knowledge they have not yet been demonstrated for feedback control.

Our participants successfully produced the movements required in the experiment (Fig 4A). Particularly, we observed clear distinctions in the terminal velocity between the hit and stop conditions, independent of the experimental schedule (blocked or mixed). A two-way repeated-measures ANOVA showed a significant main effect on condition (hit or stop, F1,13 = 544.2, p ≪ 0.001), but no significant main effect on experiment schedule (blocked or mixed, F1,13 = 0.710, p = 0.42) or schedule/condition interactions (F1,13 = 0.681, p = 0.42). In addition, a complementary Bayesian repeated-measures ANOVA analysis showed similar results, with a very strong effect [53] of condition (hit or stop, BFincl = 1.6 × 1025), and with a tendency towards no effect of schedule (blocked or mixed, BFincl = 0.379), or condition/schedule interaction (BFincl = 0.409). A similar analysis for peak velocities showed a significant main effect of condition (hit or stop, F1,13 = 5.94, p = 0.03; although BFincl = 1.12 indicates not enough evidence to either reject or accept the null hypothesis) and condition/schedule interaction (F1,13 = 19.3, p ≪ 0.001; BFincl = 32.6), but not on schedule (blocked or mixed, F1,13 = 1.52, p = 0.24; BFincl = 0.56 shows a weak tendency towards accepting null hypothesis). The Holm-Bonferroni corrected post-hoc analysis for the interaction term revealed that participants produced slightly faster movements in the mixed-hit condition, with the peak velocities matching otherwise.

Fig 4. Experimental results of stop and hit conditions in both blocked and mixed schedules.

Fig 4

A. Velocity profiles against position. Both stop conditions and both hit conditions produce respectively similar velocity profiles, showing that participants successfully performed the task in the mixed schedule. B. Normalised feedback response intensities represented as a function of time-to-target. Hit and stop movements in the mixed schedule demonstrate differences when expressed against time-to-target, that match the differences between hit and stop conditions in the blocked schedule. This supports the hypothesis of contextual controller switching between multiple task-specific controllers. C. Normalised feedback intensities in all four conditions show no differences when expressed against position or D. movement time at perturbation onset, as predicted by the OFC simulations. This questions the appropriateness of position or movement time as the reference frames in which to compare multiple feedback controllers. Error bars and shaded areas indicate 95% confidence intervals of the mean.

We examined the evolution of the experimental visuomotor responses as a function of perturbation onset position or onset time across the four different conditions (Fig 4C and 4D). When expressed against either position or time, the visuomotor intensity profiles do not show the classical bell-shaped profile where strongest responses occur in the middle of the movement and are reduced towards the beginning and end. Instead, our participants produced the strongest responses for the earliest perturbations, induced at 1/6 of the total forward movement, with further responses decaying in intensity as perturbations occurred closer to the target. Moreover, we observed no significant differences in visuomotor responses across the different conditions and schedules. Three-way repeated-measures ANOVA with condition (stop or hit), schedule (blocked or mixed) and perturbation location (5 levels) as main factors showed no effect of condition (F1,13 = 0.486, p = 0.50; BFincl = 0.238 shows substantial evidence towards no effect), schedule (F1,13 = 0.096, p = 0.76; BFincl = 0.142 shows substantial evidence towards no effect) or condition/schedule interaction (F1,13 = 0.657, p = 0.43; BFincl = 0.305 shows substantial evidence towards no effect). While we observed a significant main effect of the perturbation location (F2.9,37.7 = 61.2, p ≪ 0.001 after Greenhouse-Geisser sphericity correction; BFincl = 9.3 × 1036), such an effect was expected due to the temporal evolution of feedback responses. In addition, we observed a significant interaction between perturbation onset location and the condition (F2.1,27.0 = 6.26, p = 0.005 after the sphericity correction; BFincl = 6.86), however a Holm-Bonferroni corrected post-hoc analysis on the interaction term did not indicate any meaningful interaction effects, with none of the significant interactions appearing at the same perturbation onset location. Finally, the remaining interactions of schedule/perturbation (F2.6,33.9 = 2.67, p = 0.07 after Greenhouse-Geisser sphericity correction; BFincl = 0.289) and condition/schedule/perturbation (F2.8,36.8 = 0.233, p = 0.86 after Greenhouse-Geisser sphericity correction; BFincl = 0.075) showed no significant effects. Thus, as a whole our analysis indicates that the feedback controllers could not be differentiated when expressed as a function of the position within the movement.

When expressed against time-to-target, the visuomotor feedback responses show decreasing feedback intensities with decreasing time-to-target, with responses virtually vanishing when the time-to-target approaches zero (Fig 4B). This behaviour is consistent with our previous models describing the time-gain relationship [26]. In addition, we observe stronger increases in visuomotor feedback intensity with increasing time-to-target for the hit condition compared to the stop condition, in both blocked and mixed schedules. Such regulation was previously predicted by our time-to-target OFC model (see Figure 9C in [26]) for short times-to-target. Finally, we also observe a qualitative match between the two stop conditions (mixed and blocked) as well as between the two hit conditions (mixed and blocked), suggesting first evidence of rapid feedback controller switching in the mixed schedule. This finding holds equally well in trials immediately after a condition switch, as well as after the trials of the same movement condition (S2 Text). Qualitatively the increase of visuomotor response intensities with time-to-target for our specific results could be well described by a line function for each of the four combinations of condition and schedule. In order to get a quantitative estimate of the differences between the conditions we performed a Two-way ANCOVA analysis of visuomotor response intensity, with schedule and condition as the two factors, and time-to-target as the covariate. The results showed a significant main effect of condition (hit or stop, F1,275 = 24.8, p ≪ 0.001; BFincl = 9.46 × 103), and time-to-target (F1,275 = 222.8, p ≪ 0.001; BFincl = 1.04 × 1033), but no effect of the experimental schedule (blocked or mixed, F1,275 = 0.098, p = 0.75; BFincl = 0.138) or of schedule/condition interaction (F1,275 = 1.06, p = 0.30; BFincl = 0.304 shows tendency towards no effect). Such results indicate that we can successfully separate the two different controllers when expressing their feedback response intensities (or their gains) against the time-to-target. Furthermore, we also show that such differences are only present when comparing the controllers for different tasks, and are not dependent on the presentation schedule of these tasks. Thus, we demonstrate that our participants successfully selected an appropriate controller for a hit or a stop task, even in a schedule where the task could change on consecutive trials.

Discussion

In this study we have demonstrated that humans are capable of rapid switching between appropriate feedback controllers in the presence of different contextual cues. Specifically, our participants show systematic differences in feedback responses when performing hitting movements, compared to reach-and-stop movements. Moreover, the same systematic differences are present, both when the two tasks are performed in isolation (blocked schedule), or when rapidly switching from one task to the other (mixed schedule), showing that these differences are evoked within a single trial, and not gradually adapted. Finally, these feedback responses are also well matched with the optimal feedback control predictions for these responses in hit and stop tasks, further reinforcing accumulating evidence of optimality principals in the feedback control of human movements.

In order to gain insight into computational mechanisms that are employed when humans switch between hit and stop conditions, in this study we formulate our hypothesis through normative modelling [37, 5458]. Such an approach compares the behavioural experimental data to the results simulated computationally through a known bottom-up design. In turn, any mismatch between the data and the model rules out the mechanism, while matching behaviour provides support for the likelihood of such a mechanism. Specifically, here we simulate three different types of control movements: stop movement, where a point mass is stopped at a target 25 cm away from the start position; hit movement, where the point mass is instead brought to the same target with nonzero terminal velocity; and a long-stop movement, with similar kinematics to the hit movement within the 25 cm segment, generated by a stop movement to a secondary virtual target at 28 cm distance. The hit and stop simulations differed in the implementation of the feedback controller, with the state dependent costs for the terminal velocity and terminal acceleration reduced by a factor of 50 for the hit condition. As a result, the two models inherently simulate the behaviour that is achieved via different controllers. On the other hand, the long-stop condition was simulated by using the same controller as the stop condition, but to a target at 28 cm instead of 25 cm. Consequently, such a movement still maintained a non-zero velocity at 25 cm, virtually simulating a hit-like movement. Notably, in order to better match the kinematics of a long-stop movement to the kinematics of the hit and stop movements, we temporally modulated the activation cost R of the long-stop controller, which we have previously shown does not affect the overall feedback responses in terms of time-to-target [26]. In general, while kinematics of hit and long-stop models matched well, the two simulations predicted very different feedback response profiles when expressed both against time-to-target and against position. Finally, the responses of our participants in the hit condition matched better with the model simulation of the hit condition, rather than the simulation of the long-stop, providing evidence that humans use different feedback controllers for different tasks.

Principles of contextual switching have been extensively studied in the context of feedforward adaptation [30, 31, 34, 5962]. While these cues vary in effectiveness [30, 59] and are typically considered as relative weightings of multiple feedforward models [63], strong dynamic cues such as differences in follow-through [35, 36], lead-in [33, 61], or visual workspace [31, 32] can effectively separate the feedforward models. As multiple recent papers have demonstrated that voluntary (feedforward) and feedback control likely share neural circuits [24, 39, 6467], it is reasonable to believe that similar contextual regulation would also be present in feedback control. However, studies that have shown this parallel changes in the feedback responses to the learning of the feedforward dynamics, either examined before and after adaptation to novel dynamics [24, 64, 68, 69], or during the process of adaptation [19, 7072], meaning that the they could not distinguish between the slow adaptation of the feedback controller to each condition or the rapid switching between two controllers. Moreover, other studies have suggested that feedforward and feedback controllers are learned separately [73, 74] and may even compete with one another [75], suggesting that these share different neural circuits and may have different properties. In this study we showed that in the mixed schedule, where the task goal unpredictably switched between hit or stop tasks on consecutive trials, participants evoked different control policies for each task. Furthermore, these policies, evoked within mixed schedule, well matched with the respective policies in the blocked schedule, suggesting that they were not only different from one another, but also appropriate for each task, showing the strong separation of the two contexts. While this is not unexpected, as the two hit and stop tasks are significantly different in their dynamics and thus should act as a strong contextual cue, one important result is that we demonstrated that the context regulates the feedback, and not only feedforward control. Finally, our results are also consistent with the accumulating evidence of the shared relationship between feedforward and feedback control in human reaching.

One reason why contextual effects on feedback control have not been broadly studied, is that it is difficult to quantify what really constitutes a change in feedback control policy. For example, we can trigger responses of different magnitudes by changing the size of the perturbation [41, 42], inducing perturbations at different positions [26, 40] or at different times [20, 37, 76]. However, computationally such differences in response intensity can be achieved within the same optimal feedback controller without ever changing control parameters [38]. On the other hand, experimental tasks, presented in some of these studies, e.g. reaching towards narrow, wide or long targets, inherently require different feedback controllers. Specifically, assuming a similar controller to the one we present in this work, a wide target implies a reduced ωp,t weight in x-axis compared to the narrow target, thus leading to a different optimum of control matrix L (Eqs 2 and 3). Indeed, human responses in tasks where the target structure changed (either by shape or by the presence of obstacles [42, 44, 45]) were consistent with the OFC predictions of two independent controllers [42]. In this article we present two tasks that also require different feedback controllers, but achieve that while maintaining the target shape. Instead, we invoke different controllers by modulating the task requirement of either stopping at the target, or hitting through it. In addition, by combining an OFC model predictions with our previous work, showing that the time-to-target is a strong predictor of the feedback intensity in optimal control tasks [26], we not only show that the human behaviour is consistent with two independent controllers, but also that it cannot be explained by one controller. Specifically, we simulate the behaviour either by recomputing the controller L (Fig 3B–3D, hit and stop), or by updating the state estimate x^ and using the same controller L (Fig 3B–3D, stop and long-stop), to compare with the experimental results (Fig 3F and 3G). These results show that humans indeed change their control policies when the task goal (e.g. hit or stop) changes. Thus, by combining behavioural results with normative control models we can clearly identify that it is specifically the change in control, and not other mechanisms, that is responsible for the regulation observed in the experimental data.

Previous studies have demonstrated that visuomotor feedback intensity profiles are roughly bell-shaped along the movement—low at the beginning and the end, and peaking in the middle [26, 40]—leading to assumptions that these gains might parallel the velocity [39, 67]. Our simulations and experimental results (Fig 3D and 3G) demonstrate that this bell-shape profile is not fixed, and that other profiles are possible. In our previous work, we established a robust relationship between the visuomotor feedback intensities and time-to-target, demonstrating that time-to-target is the fundamental variable that modulates the responses, given that the task goal (and thus the feedback controller) remains the same [26]. This means that the bell-shaped profile is simply a by-product of a specific timing of perturbations, and is not regulated by their onset location. As a consequence, the shape of these feedback intensity profiles can be modulated away from the bell-shaped profile by changing movement speed, target distance or acceleration profile. Such results illustrate possible caveats in the experimental paradigms of motor control: historically, some of the task requirements have been largely consistent, particularly in terms of reaching distance, reaching speed or duration. This may result in some measured behavioural outcomes being specific to these kinematics or conditions rather than representing the general features of the motor control system. Thus, while we do not advocate for routinely altering the standard experimental and analytical methods, it is worth considering the specific biases that such methods may contribute to a given study.

One popular way of looking at the visuomotor responses in humans is how they vary with position in a movement. Indeed, numerous studies either analyse the evolution of responses against position [26, 39, 40], or induce perturbations based on a fixed position [7, 9, 25, 41, 4951], with the expectation that these perturbations induce similar responses unless the control changes. For example, [9] demonstrated different feedback responses, induced at a matched position in movements towards different targets. While we believe that these different target properties indeed suggest different feedback controllers, such a distinction cannot be reliably tested with only one perturbation, matched by position. Our results clearly demonstrate the limitations of position as the main variable to probe such control. On one hand, even with similar kinematics for the majority of the movement, simulations of stop and long-stop movements predict radically different responses at matching positions (Fig 3D), despite the fact that these are generated with identical controllers. On the other hand, different controllers for hit and stop conditions still produced roughly matching feedback responses at the same position, consistent with the experimental data (Fig 3D and 3G). In contrast to position as the main variable, OFC simulations in both this study and our previous work [26] show that the same controller, when expressed against time-to-target, produces matching response profiles, independent of other kinematic factors such as movement velocity or position of the perturbation onset (Fig 3B and 3C). Furthermore, different controllers, such as hit and stop, produce feedback responses with systematic differences when expressed against time-to-target, exactly as demonstrated by our participants. Thus, we propose that time-to-target is the better reference frame for comparing feedback responses.

In this study we have raised two alternative hypotheses about the regulation of feedback controllers within the mixed schedule. The first possibility is that the feedback control gradually adapts to a given task over a few consecutive trials, similar to the feedforward control during learning of a force field or visuomotor rotation. If such adaptation was true, we expect different feedback intensities between the hit and stop conditions in the blocked schedule as the controller has enough trials to reach steady-state behaviour. However, in the mixed schedule the controller would drift between the equilibrium of hit and stop conditions, producing similar responses for mixed hit and mixed stop conditions. Note that even in such a case where only a single controller is performing both hit and stop trials, we would not necessarily expect any effects on the kinematics or the participant’s ability to complete the task. Instead, due to the feedback nature of the control, a sub-optimal controller would still complete the movement, but produce sub-optimal (e.g. more costly) responses in the presence of external disturbances. The second possibility is that an appropriate controller is selected before each movement based on the provided context, allowing immediate switching between tasks. In this case, the feedback intensity profiles would match for the same task, regardless of the schedule of their presentation. That is, we expect to see similarities between both hit conditions, as well as between both stop conditions, but differences between any two hit and stop conditions. Our experimental results strongly support the latter option, as we not only observe differences between mixed hit and mixed stop conditions, but also observe their respective match with the blocked conditions. While our results do not rule out the adaptation of feedback controllers in general, we do demonstrate that different optimal controllers can be rapidly selected and switched between for familiar tasks.

One important aspect of the relationship between feedforward and feedback control is that modulating one of them should affect the behaviour of the other. Indeed, previous work has demonstrated that human participants changed their feedback gains after adapting their feedforward models to novel dynamics [11, 19, 24, 64, 6872]. However, an adapted movement in the force field typically produces kinematics that are similar to those in baseline movements, suggesting that such change of gains is achieved at matching times-to-target, and with the same task goal. Thus, our proposed framework that the relation between feedback intensities and time-to-target is unique for a unique controller would predict that the feedback gains would remain unchanged. As a result, we can not directly explain this change of control gains, unless the feedback controller somehow changes during adaptation. One factor driving such a change is that adapted movements in the force field are more effortful than baseline movements, due to additional muscle activity required to compensate for the force. An increased effort in the context of OFC simulations would thus increase the model activation cost R, resulting in a change of optimal feedback gains and intensities at matching times-to-target. In addition, the presence of a force field likely influences the biomechanics of the movement (particularly the muscle viscosity b), changes the state transition due to the external dynamics (via state transition matrix A), and updates the state uncertainty [77], resulting in the same controller being applied to a different control plant, and thus producing different responses. Moreover, if the controller is optimised to to this new control plant, adaptation will inevitably require a new feedback controller. Therefore, such changes in feedback control are expected, even though conventionally it appears that the task goal remains the same after adaptation to the novel dynamics.

Even though many recent studies use force channel trials [52] to accurately measure the visuomotor feedback responses [7, 11], often these brief perturbation trials are complemented with maintained perturbation trials [19, 26, 41, 4648, 50, 51]. This is because brief perturbations within a channel trial are task-irrelevant, and can be ignored without compromising the task, whereas maintained perturbations strengthen these responses as they require an active correction for the participant to reach the target. However, we have recently shown that these maintained perturbations also force a non-trivial extension of the movement duration compared to the non-perturbed movement, and thus complicate the relationship between the perturbation onset location and the time-to-target. Hence, in order to consistently evaluate the control behaviour and its relation to the time-to-target, here we deliberately chose to only induce perturbations within the force channels and not to include the maintained perturbations. Although this generally decreases overall feedback intensities, our participants produced clear responses that exhibited the temporal evolution as predicted by the OFC model simulations.

Another possible limitation of using channel trials to probe the feedback control is the potential interference of the stretch reflex. Specifically, small forces produced on the hand by the channel could set on the feedback corrections [78] that superimposed onto the measured visuomotor responses. However, as the onset of the force channel occurs prior to any movement of the participant, long before the time of the visual perturbation, the channel onset will not produce a stretch reflex response timed to the visual stimuli. More importantly, such effects, as well as any other corrections to the channel onset would be present in all channel trials (including zero-perturbation trials), and thus would cancel-out in the net feedback responses, as they would not depend on the direction of the visual perturbation. As a result, it is unlikely that the force channels introduced systematic effects into our recorded visuomotor feedback signal. Similarly, we also saw no behaviour differences between simulations of free movements, that we presented in our results, and analogous simulations of matching duration movements in channel trials (S1 Fig).

Most studies of motor learning study contextual switching in conjunction with dual adaptation by introducing participants to novel force fields or visuomotor rotations, for which they do not have pre-existing feedforward controllers. In turn, we typically see slow, simultaneous adaptation to applied perturbations, as well as context-induced switching of the memories after the transient learning phase is over. Importantly, for familiar tasks this switching is evoked immediately, without the need to re-learn the dynamics again on re-exposure. This is clearly seen on the second or later days after adapting to dual force fields [35, 79]. In this study, our main goal was to demonstrate that such contextual switching is also possible for feedback controllers, rather than to demonstrate gradual adaptation. Therefore, in our experimental design we consciously selected two tasks (stop or hit) that were not novel to our participants. While there remains a possibility that due to the dynamics of the vBot environment both tasks were different to stop or hit movements outside of the lab, and thus novel to participants, we always started our studies with the blocked schedule, and only then followed with the mixed schedule to make sure that the different baseline controllers are already available to our participants. An interesting control would be to first test the participants in the mixed schedule, followed by the blocked schedule. However, we believe such control would mainly test whether the two task choices were novel to participants or not, which is not the focal point of our study.

In summary, here we again demonstrate that time-to-target (which could be considered as one form of urgency) [20, 26, 39, 76, 80], and not position or velocity, act as a primary predictor for the feedback response intensity when the task goal is fixed. Moreover, when comparing multiple tasks, the time-to-target reference frame consistently separates the feedback control policies for these tasks—an outcome that fails when comparing two different controller gains within the position reference frame. While position within the movement, and velocity at the time of a perturbation, definitely influence the controller responses, our results clearly demonstrate that the effect of these variables on overall control may be somewhat exaggerated in the previous literature. For example, our participants produced temporal evolution of the responses to visual perturbations that neither paralleled the velocity, nor showed the typical variation with position (with peak responses achieved mid-movement), but could be explained by the time-to-target dependency that was predicted by OFC. In addition, participants were able to switch their feedback controller from one trial to another, demonstrating the principle of contextual switching for feedback control. Such switching, well known in feedforward control, further reinforces accumulating evidence of the shared connections between feedforward and feedback control. Most importantly, our results demonstrate that the visuomotor feedback control in humans not only follows the principles of optimal control for a singular task, but also selects an appropriate controller for that task upon presenting the relevant context.

Methods

Ethics statement

The study was approved by the Ethics Committee of the Medical Faculty of the Technical University of Munich. All participants have provided a written informed consent before participating in the study.

Experimental setup

Fourteen right-handed [81] human participants (age 21–29 years, 5 females) with no known neurological diseases and naïve to the purpose of the study took part in the experiment. Participants performed forward reaching movements either to a target (stop condition) or through the target (hit condition) while grasping the handle of a robotic manipulandum (vBOT, [82]) with their right hand, with their right arm supported on an air sled. Participants were seated in an adjustable chair and restrained using a four-point harness in order to limit the movement of the shoulder. A six-axis force transducer (ATI Nano 25; ATI Industrial Automation) measured the end-point forces applied by the participant on the handle. Position and force data were sampled at 1 kHz, while velocity information was obtained by differentiating the position over time. Visual feedback was provided via a computer monitor and mirror system, such that this system prevented direct vision of the hand and arm, and the virtual workspace appeared in the horizontal plane of the hand (Fig 5A). The exact timing of any visual stimulus presented to the participant was determined from the graphics card refresh signal.

Fig 5. Experimental setup.

Fig 5

A. Participants controlled a yellow cursor by moving a robotic handle. The cursor was projected via a screen-mirror system directly into the plane of the participant’s hand. Figure copyright 2008 Society for Neuroscience. B. Stop condition. Participants were instructed to reach with the cursor through a red line and stop within the blue target. Target perturbations were occasionally induced via target jumps of 2 cm laterally. C. Hit condition. Participants were instructed to reach through the red target and stop within the blue area. Target perturbations (2 cm laterally) were again induced on random trials. D. Visual feedback was presented after each trial. Participants were shown the workspace with the start position and the target still present. In addition, two indicators were displayed. A bar chart at the top-right part of the workspace scaled proportionally with the absolute peak velocity, and was green if the velocity was within the required range as indicated by two grey brackets. A horizontal bar indicating the actual forward location where this peak velocity was achieved was displayed between the start and target positions. This bar was green if the peak location matched the experimental requirements, indicated by two large rectangular blocks. If both location and peak amplitude criteria were successfully fulfilled, participants were rewarded with one point. If at least one of the two criteria was not fulfilled, the respective indicator turned red instead of green, and no point was provided. In both hit and stop experiments participants were instructed to move through the red workspace element and stop at the blue, and were rewarded with one point if they both intercepted the target and fulfilled both velocity requirements.

Participants controlled a yellow cursor (circle of 1.0 cm diameter) by moving the robotic handle. The centre position of this cursor in the virtual workspace always matched the physical position of the handle. Every experimental trial was initiated when the cursor was brought into the start position (grey circle of 1.6 cm diameter), which was located 20 cm in front of participants’ chest and centred with the body. When the cursor was within this start position, the circle changed from grey to white and the type of experimental trial was indicated by the presentation of a target. After a random delay, sampled from an exponential distribution with λ = 0.7 and truncated outside 1.0 s–2.0 s interval, a tone was played to indicate the start of the movement. If participants failed to leave the start position within 1000 ms after this tone, the procedure of the current trial was aborted and restarted.

Over the course of the experiment the participants were tasked to complete two types of movements: stop movements, where they were required to stop within the target (a circle of 1.2 cm diameter, located 25.0 cm in front of the start position) (Fig 5B), or hit movements, where they had to intercept the target without stopping, and instead stop in a designated stopping area (a blue rectangle, [width, height] = [15 cm, 4 cm], centred 5 cm beyond the target) (Fig 5C). The reaching movement was considered complete once the centre of the cursor was maintained for 600 ms either within the area of the target in stop trials, or within the stopping area in the hit trials. In addition, if the movement duration was longer than 4.0 s, the trial was timed-out and had to be repeated. After each trial, the participant’s hand was passively moved back to the start position by the vBOT, while the feedback of the previous trial was provided on screen (Fig 5D). All movements were self-paced, with short breaks provided every 208 trials, and a longer break (5–10 minutes) provided at the half-way point of the experiment.

Experimental paradigm

Participants performed reaching movements in four conditions—blocked stop, blocked hit, mixed stop and mixed hit—that were part of a single experiment. Across these conditions, participants were required to either reach to the target and stop there (the stop conditions), or to reach through the target and stop in the designated stopping area (hit conditions). In order to easily cue the distinction between the hit and stop conditions, the two types of trials had small visual differences. For the hit condition participants were presented with a red target (a red circle of 1.2 cm diameter) and a rectangular blue stopping area of dimensions 15 cm by 4 cm, centred 5 cm beyond the target (Fig 5C). For the stop condition participants were presented with a target that was otherwise identical to the target in hit condition, but was blue in colour, and with a horizontal, 15 cm wide red line, that was placed 3 cm before the target (Fig 5B). While this line had no functional interaction with the experiment, it allowed us to consistently instruct the participants to always perform reaching movements so that they intercept the red element in the workspace, and stop within the blue element.

In order to probe the visuomotor feedback responses of participants, during some reaching movements we briefly perturbed the target by shifting it 2.0 cm laterally for 250 ms before returning back to the original position (Fig 2A). These perturbed trials were always performed within the virtual mechanical channel, where participants were free to move along the line between the start position and the target, but were laterally constrained by a virtual viscoelastic wall with stiffness of 2 N/m and damping 4000 Ns/m [7, 40, 52]. As the perturbations were always task-irrelevant, this channel therefore did not obstruct participants to complete the trial. However, as participants still produced involuntary feedback responses due to the target shift, the virtual channel allowed us to record the forces that participants produced due to the perturbations and measure the intensities of the visuomotor feedback responses.

For each type of movement (i.e. hit or stop) there was a total of 11 different perturbations. Ten of these perturbations were cued during the reaching movement as participants crossed one of the five perturbation onset locations, equally spaced between the start position and the target position (4.2, 8.3, 12.5, 16.7, and 20.8 cm from the centre of the start position). At all of these five locations the target could either shift to the left or to the right. In addition, one zero-magnitude perturbation was also included, where the movement was simply performed within the channel without any target shift in order to probe the force profile of the natural movement. Finally, in addition to the perturbation trials we also included non-perturbed trials where participants simply reached towards the target without any target perturbation and without the virtual channel constraining the hand.

In order to present the different perturbations in a balanced manner, we combined different types of trials in blocks of 16 trials. One block of 16 trials contained 11 perturbed trials (5 perturbation onset locations x 2 directions, plus one neutral movement in the force channel), and 5 non-perturbed movements without the force channel. Each of the four experimental conditions consisted of 26 such blocks, with the order of trials fully randomised within each block, resulting in 416 trials per condition and 1664 trials overall.

In the first half of the experiment, participants were always presented with the two blocked-design conditions (blocked hit and blocked stop), with the order of the conditions balanced across the population of participants. That is, each participant started with 416 trials of stop trials, followed by 416 hit trials or vice-versa. In the second half of the experiment, the two final conditions—mixed hit and mixed stop—were presented in a pseudo-random order within the same blocks. While individual trials within mixed conditions were identical to the individual trials within the respective blocked conditions, they were now presented in a pseudo-randomised order. Specifically, the remaining 832 trials were divided into 26 blocks of 32 trials, with each block consisting of 16 hit and 16 stop trials fully randomised within this block. Such randomisation resulted in a percentage split where 52% of trials were presented after a condition switch, 26% of trials were presented after exactly one trial of the same condition, 12%—after exactly two trials of the same condition, and larger clusters with diminishing frequency.

Feedback regarding movement kinematics

In theory, the movements in hit condition could be interpreted as the movements, where the goal is to go through the via-point (the red target) and stop at the blue stopping area. As a result, such movements could simply be treated by participants as the stop movements with longer movement distance and a less restrictive target. Typically for such reaching movements, humans would produce a velocity profile that is bell-shaped, with peak velocity near the middle of the movement, and therefore further along the movement than in the stop condition. In order to avoid such differences and keep the velocity profiles comparable between the two conditions, we provided the task-relevant feedback on the velocity profiles, specifically the peak velocity and peak velocity location, to our participants (Fig 5D).

Independent of the experimental condition, participants were required to produce the movements with the peak velocity of 60 cm/s ± 8 cm/s, and the peak velocity location within 11.25 cm–13.75 cm movement distance (or 45%-55% of the distance between the start location and the target). The peak velocity was indicated as the small bar chart at the top-right of the screen, with the required velocity range indicated by two grey brackets. If the velocity target was matched, the bar chart turned green, otherwise it was red. Similarly, the peak velocity location was shown as a horizontal bar, centred around the movement distance where the peak velocity was reached. If this location was within the target range (also indicated by grey brackets), it was displayed as green, otherwise it was red. Participants were rewarded one point if both velocity requirements were successfully met, and the cursor intercepted the target during the movement.

Data analysis and code availability

All data was pre-processed for the analysis in MATLAB 2017b: force and kinematic time series were low-pass filtered with a tenth-order zero-phase-lag Butterworth filter with 15 Hz cutoff and resampled at 1 kHz to account for an occasional missed sample during the signal recording. All subsequent analysis was performed in Python 3.9.4 and JASP v0.14.1 [83]. First, raw visuomotor feedback intensities were calculated from the force responses, recorded after the induction of a target perturbation. Specifically, for every perturbation trial we averaged the lateral force response over a time window of 180 ms–230 ms after the onset of the perturbation, and subtracted a neutral force profile over the matching time window. This method and the particular time window has now been used in numerous studies to calculate the intensity of the early involuntary visuomotor feedback response [7, 25, 26, 40, 41, 50, 51]. As the direction of the response differed based on the perturbation direction, we reversed the direction of the intensities of responses to the leftward perturbations, so that positive intensities always indicate movements in correct direction, and grouped all intensities by the perturbation onset location. Second, we normalised mean feedback responses between 0 and 1 for each participant in order to avoid the group effect being biased towards participants with stronger responses. Finally, in our analysis the start of all movements was defined as the last time sample where the cursor is still within the area of the start circle, and the end of the movement was defined as the last time sample before the cursor enters the target circle. Time-to-target values were extracted from the data for every perturbation trial by subtracting the perturbation onset time from the movement end time.

In this article we provide two types of statistical analysis: the conventional frequentist statistics, as well as complementary Bayesian analysis that is presented as Bayesian factors [53], which instead of a simple hypothesis testing provides evidence for or against the null hypothesis. As a result, among other things, Bayesian analysis allows us to distinguish between accumulating evidence for the null hypothesis, and simply lacking evidence in either direction due to low power or small sample size.

All the Jupyter notebooks for the data analysis, pre-processed experimental data and statistical analysis conducted in this article are available at https://doi.org/10.6084/m9.figshare.17113904.v1.

Computational modelling

In this study we formulated our initial hypothesis about the feedback control mechanisms in humans by first simulating the behaviour of the optimal feedback controller (OFC). Specifically, we used a finite-horizon linear-quadratic regulator framework—a relatively simple OFC that assumes perfect sensory input, as well as no control-dependent noise, while still being able to capture a significant part of the variance of human reaching movements [38, 84]. In order to model the feedback behaviour of our human participants, we first simulated virtual movements of a point mass with m = 1 kg, and an intrinsic muscle damping b = 0.1 Ns/m. This point mass was controlled in two dimensions by two orthogonal force actuators that simulated muscles, and regulated by a control signal ut via a first-order low-pass filter with a time constant τ = 0.06 s. At time t within the movement, such system could be described by the state transition model:

xt+1=Axt+B(ut+ξt), (1)

where A is a state transition matrix, B is a control matrix, and ξt is additive control noise. For one spacial dimension A and B are defined in discrete time as:

A=[1δt001-bδt/mδt/m001-δt/τ],
B=[0δt/τ0]

Finally, to simulate our model in discrete time we used the sampling rate δt = 0.01 s.

State xt exists in the Cartesian plane and consists of position p, velocity v and force f (two dimensions each). The control signal ut is produced via the feedback control law:

ut=-Lxt (2)

where L is a matrix of optimal feedback control gains, obtained by optimising the performance index (also known as the cost function):

J=t=0NxtTQtxt+utTRtut=t=0Nωp,t(pt-p*)2+ωv,tvt2+ωf,tft2+ωr,tut2. (3)

Here xtTQxt and utTRut are two components of the total cost, known as state-cost and a control-cost respectively. In addition, ωp, ωv and ωf are position, velocity and force state cost parameters, p* is a target position, ωr is the activation cost parameter and N is the duration of the movement, here required as a model input. Within the finite-horizon formulation, the cost parameters can be non-stationary and thus be different for every time-point. However, in our simulations we set Q = 0 for tN, consistent with [37, 85].

In this study we simulate three different controllers that we call stop, hit and long-stop. While the stop and long-stop controllers are derived from the identical set of costs state-costs Q, they are used for slightly different movements (25 cm and 700 ms for stop, 28 cm and 800 ms for long-stop). We used ωp = 1.5, ωv = 1 and ωf = 0.1 as the values for the state cost parameters in this model, and the activation cost R = 3 × 10−6. Furthermore, in order to better match the forward velocity profiles, we also introduced a non-stationarity in the activation cost R of the long stop movement, where the total integral of the activation cost over the movement is not changed, but this cost develops over time during the movement. Specifically, at a time t in the trial, the activation cost for the long-stop movement was computed by:

Rlong-stop(t)=RC(t), (4)

where

C(t)exp(pt+qr), (5)

and the mean of C(t) equals 1 for the duration of the trial, so that Rlong-stop produces the same amount of activation as R over the duration of the trial. Here p = 1, q = -1000 and r = 65 are constants, fit via trial and error in order to produce the forward velocity profile of long-stop condition that matches the velocity of stop and hit conditions. We have previously shown that such modulation only affects the kinematics of the movement, but does not change the feedback responses when expressed against the time-to-target [38]. On the other hand, in order to incentivise the hit controller to produce faster movements at the target, we reduced the cost parameters for terminal velocity and terminal force by a factor of 50. As a result, such controller produced hit-like movements that were aimed directly at a target, positioned at 25 cm distance, over 620 ms, which matched the kinematics of the long-stop controller over this movement segment.

Finally, for each controller we simulated feedback response intensity profiles along the movement, which we then used to compare the control policies predicted by each controller. To do so, we induced lateral target perturbations of 2 cm magnitude during the simulated movement to the target and recorded the corrective force, produced by each controller as a result of these perturbations. While in the experimental study we only induced such perturbations at five different onsets due to practical reasons, in our simulations we could perturb the movements at every point in time and fully map the response intensity profiles over the movement. Thus, for each model we simulated different movements with perturbations at each movement time-step (i.e. every 10 ms), with one perturbation only happening once per movement. In addition, to simulate the visuomotor delay that is present in humans, we delayed the onset of each perturbation by 150 ms, so that for the perturbation triggered at time t, the target is shifted at time t + 150 ms. We then averaged the force, produced by our model over a time interval 10 ms–60 ms after the target was shifted (160 ms–210 ms after the perturbation was triggered), representing the visuomotor response window of 180 ms–230 ms in human subjects. Note that we used an earlier window for the model simulations than for the human subjects as the responses in the simulations ramp up fast due to muscles simplified to a single low pass filter.

Supporting information

S1 Text. Initial learning of feedback controllers.

Here we verify whether our participants developed different feedback controllers for hit and stop tasks over the blocked schedule, or if these controllers were innate. To do so, we analyse the visuomotor responses in the first few blocks of the study, showing that these responses can be considered innate.

(PDF)

S2 Text. Effect of condition clustering in mixed schedule.

Visuomotor responses were analysed in trials immediately following the condition switch (hit to stop or stop to hit) in mixed schedule. Analysis shows same regulation as in the entirety of the mixed schedule, implying rapid switching.

(PDF)

S1 Fig. OFC model simulations in channel trials.

Model simulations performed in cbannel trials, instead of free movement responses. A. Velocity profiles for stop (blue), hit (red) and long-stop (green) conditions. B. Model simulations of feedback intensities as a function of time-to-target and C. position for the three conditions. Simulations in the channel trials qualitatively predict the same regulation as do the simulations of free movements.

(EPS)

Acknowledgments

We thank Hanna Hoogen, Isabelle Hoxha and Oliver Gerke for contributions to preliminary projects related to this manuscript. We thank Clara Günter, Jing Zhang, Sae Franklin, and Marion Forano for their feedback on this manuscript.

Data Availability

All the Jupyter notebooks for the data analysis, pre-processed experimental data and statistical analysis conducted in this article are available at https://doi.org/10.6084/m9.figshare.17113904.v1.

Funding Statement

The author received no specific funding for this work.

References

  • 1. Houk JC. An Assessment of Stretch Reflex Function. Progress in Brain Research. 1976;44(C):303–314. doi: 10.1016/S0079-6123(08)60741-4 [DOI] [PubMed] [Google Scholar]
  • 2. Pruszynski JA, Scott SH. Optimal feedback control and the long-latency stretch response. Experimental Brain Research. 2012;218(3):341–359. doi: 10.1007/s00221-012-3041-8 [DOI] [PubMed] [Google Scholar]
  • 3. Dimitriou M. Enhanced Muscle Afferent Signals during Motor Learning in Humans. Current Biology. 2016;26(8):1062–1068. doi: 10.1016/j.cub.2016.02.030 [DOI] [PubMed] [Google Scholar]
  • 4. Barr CC, Schultheis LW, Robinson DA. Voluntary, non-visual control of the human vestibulo-ocular reflex. Acta Oto-Laryngologica. 1976;81(5-6):365–375. doi: 10.3109/00016487609119974 [DOI] [PubMed] [Google Scholar]
  • 5. Tabak S, Collewijn H. Human vestibulo-ocular responses to rapid, helmet-driven head movements. Experimental Brain Research. 1994;102(2):367–378. doi: 10.1007/BF00227523 [DOI] [PubMed] [Google Scholar]
  • 6. Prablanc C, Martin O. Automatic control during hand reaching at undetected two-dimensional target displacements. Journal of Neurophysiology. 1992;67(2):455–469. doi: 10.1152/jn.1992.67.2.455 [DOI] [PubMed] [Google Scholar]
  • 7. Franklin DW, Wolpert DM. Specificity of Reflex Adaptation for Task-Relevant Variability. Journal of Neuroscience. 2008;28(52):14165–14175. doi: 10.1523/JNEUROSCI.4406-08.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Izawa J, Shadmehr R. On-Line Processing of Uncertain Information in Visuomotor Control. Journal of Neuroscience. 2008;28(44):11360–11368. doi: 10.1523/JNEUROSCI.3063-08.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Knill DC, Bondada A, Chhabra M. Flexible, Task-Dependent Use of Sensory Feedback to Control Hand Movements. Journal of Neuroscience. 2011;31(4):1219–1237. doi: 10.1523/JNEUROSCI.3522-09.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Reynolds RF, Day BL. Direct visuomotor mapping for fast visually-evoked arm movements. Neuropsychologia. 2012;50(14):3169–3173. doi: 10.1016/j.neuropsychologia.2012.10.006 [DOI] [PubMed] [Google Scholar]
  • 11. Franklin S, Wolpert DM, Franklin DW. Visuomotor feedback gains upregulate during the learning of novel dynamics. Journal of Neurophysiology. 2012;108(2):467–478. doi: 10.1152/jn.01123.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Pruszynski JA, Gribble PL, Corneil BD. A rapid visuomotor response on the human upper limb is selectively influenced by implicit motor learning. Journal of Neurophysiology. 2018;121(1):85–95. doi: 10.1152/jn.00720.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Zhang Y, Brenner E, Duysens J, Verschueren S, Smeets JBJ. Postural responses to target jumps and background motion in a fast pointing task. Experimental Brain Research. 2018;236(6):1573–1581. doi: 10.1007/s00221-018-5222-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Saijo N, Murakami I, Nishida S, Gomi H. Large-Field Visual Motion Directly Induces an Involuntary Rapid Manual Following Response. Journal of Neuroscience. 2005;25(20):4941–4951. doi: 10.1523/JNEUROSCI.4143-04.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Baram Y, Miller A. Auditory feedback control for improvement of gait in patients with Multiple Sclerosis. Journal of the Neurological Sciences. 2007;254(1-2):90–94. doi: 10.1016/j.jns.2007.01.003 [DOI] [PubMed] [Google Scholar]
  • 16. Oscari F, Secoli R, Avanzini F, Rosati G, Reinkensmeyer DJ. Substituting auditory for visual feedback to adapt to altered dynamic and kinematic environments during reaching. Experimental Brain Research. 2012;221(1):33–41. doi: 10.1007/s00221-012-3144-2 [DOI] [PubMed] [Google Scholar]
  • 17. Saunders JA, Knill DC. Humans use continuous visual feedback from the hand to control fast reaching movements. Experimental Brain Research. 2003;152(3):341–352. doi: 10.1007/s00221-003-1525-2 [DOI] [PubMed] [Google Scholar]
  • 18. Saunders JA, Knill DC. Humans use continuous visual feedback from the hand to control both the direction and distance of pointing movements. Experimental Brain Research. 2005;162(4):458–473. doi: 10.1007/s00221-004-2064-1 [DOI] [PubMed] [Google Scholar]
  • 19. Franklin S, Wolpert DM, Franklin DW. Rapid visuomotor feedback gains are tuned to the task dynamics. Journal of Neurophysiology. 2017; p. jn.00748.2016. doi: 10.1152/jn.00748.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Oostwoud Wijdenes L, Brenner E, Smeets JBJ. Fast and fine-tuned corrections when the target of a hand movement is displaced. Experimental Brain Research. 2011;214(3):453–462. doi: 10.1007/s00221-011-2843-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Oostwoud Wijdenes L, Brenner E, Smeets JBJ. Comparing online adjustments to distance and direction in fast pointing movements. Journal of Motor Behavior. 2013;45(5):395–404. doi: 10.1080/00222895.2013.815150 [DOI] [PubMed] [Google Scholar]
  • 22. Day BL, Lyon IN. Voluntary modification of automatic arm movements evoked by motion of a visual target. Experimental Brain Research. 2000;130(2):159–168. doi: 10.1007/s002219900218 [DOI] [PubMed] [Google Scholar]
  • 23. Diedrichsen J. Optimal Task-Dependent Changes of Bimanual Feedback Control and Adaptation. Current Biology. 2007;17(19):1675–1679. doi: 10.1016/j.cub.2007.08.051 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Maeda RS, Cluff T, Gribble PL, Pruszynski JA. Feedforward and feedback control share an internal model of the arm’s dynamics. Journal of Neuroscience. 2018;38(49):10505–10514. doi: 10.1523/JNEUROSCI.1709-18.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Cross KP, Cluff T, Takei T, Scott SH. Visual Feedback Processing of the Limb Involves Two Distinct Phases. Journal of Neuroscience. 2019;39(34):6751–6765. doi: 10.1523/JNEUROSCI.3112-18.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Česonis J, Franklin DW. Time-to-target simplifies optimal control of visuomotor feedback responses. eNeuro. 2020;7(2):1–17. doi: 10.1523/ENEURO.0514-19.2020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Shadmehr R, Brashers-Krug T, Mussa-Ivaldi FA. Interference in Learning Internal Models of Inverse Dynamics in Humans. Advances in Neural Information Processing Systems 7. 1995;7:1117–1124. [Google Scholar]
  • 28. Bock O, Schneider S, Bloomberg J. Conditions for interference versus facilitation during sequential sensorimotor adaptation. Experimental Brain Research. 2001;138(3):359–365. doi: 10.1007/s002210100704 [DOI] [PubMed] [Google Scholar]
  • 29. Sing GC, Smith MA. Reduction in learning rates associated with anterograde interference results from interactions between different timescales in motor adaptation. PLoS Computational Biology. 2010;6(8). doi: 10.1371/journal.pcbi.1000893 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Howard IS, Wolpert DM, Franklin DW. The effect of contextual cues on the encoding of motor memories. Journal of Neurophysiology. 2013;109(10):2632–2644. doi: 10.1152/jn.00773.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Forano M, Franklin DW. Timescales of motor memory formation in dual-adaptation. PLoS Computational Biology. 2020;16(10):1–33. doi: 10.1371/journal.pcbi.1008373 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Hirashima M, Nozaki D. Distinct motor plans form and retrieve distinct motor memories for physically identical movements. Current Biology. 2012;22(5):432–436. doi: 10.1016/j.cub.2012.01.042 [DOI] [PubMed] [Google Scholar]
  • 33. Howard IS, Ingram JN, Franklin DW, Wolpert DM. Gone in 0.6 seconds: The encoding of motor memories depends on recent sensorimotor states. Journal of Neuroscience. 2012;32(37):12756–12768. doi: 10.1523/JNEUROSCI.5909-11.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Sarwary AME, Stegeman DF, Selen LPJ, Medendorp WP. Generalization and transfer of contextual cues in motor learning. Journal of Neurophysiology. 2015;114(3):1565–1576. doi: 10.1152/jn.00217.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Howard IS, Wolpert DM, Franklin DW. The value of the follow-through derives from motor learning depending on future actions. Current Biology. 2015;25(3):397–401. doi: 10.1016/j.cub.2014.12.037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Sheahan HR, Franklin DW, Wolpert DM. Motor Planning, Not Execution, Separates Motor Memories. Neuron. 2016;92(4):773–779. doi: 10.1016/j.neuron.2016.10.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Liu D, Todorov E. Evidence for the Flexible Sensorimotor Strategies Predicted by Optimal Feedback Control. Journal of Neuroscience. 2007;27(35):9354–9368. doi: 10.1523/JNEUROSCI.1110-06.2007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Česonis J, Franklin DW. Mixed-horizon optimal feedback control as a model of human movement. Neurons, Behavior, Data analysis, and Theory. 2021; p. 1–36. [Google Scholar]
  • 39. Poscente SV, Peters RM, Cashaback JGA, Cluff T. Rapid Feedback Responses Parallel the Urgency of Voluntary Reaching Movements. Neuroscience. 2021;. doi: 10.1016/j.neuroscience.2021.07.014 [DOI] [PubMed] [Google Scholar]
  • 40. Dimitriou M, Wolpert DM, Franklin DW. The Temporal Evolution of Feedback Gains Rapidly Update to Task Demands. Journal of Neuroscience. 2013;33(26):10898–10909. doi: 10.1523/JNEUROSCI.5669-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Franklin DW, Reichenbach A, Franklin S, Diedrichsen J. Temporal Evolution of Spatial Computations for Visuomotor Control. Journal of Neuroscience. 2016;36(8):2329–2341. doi: 10.1523/JNEUROSCI.0052-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Nashed JY, Crevecoeur F, Scott SH. Influence of the behavioral goal and environmental obstacles on rapid feedback responses. Journal of Neurophysiology. 2012;108(4):999–1009. doi: 10.1152/jn.01089.2011 [DOI] [PubMed] [Google Scholar]
  • 43. Orban de Xivry JJ. Trial-to-Trial Reoptimization of Motor Behavior Due to Changes in Task Demands Is Limited. PLoS ONE. 2013;8(6). doi: 10.1371/annotation/2aa90b9a-1634-4ab7-b5e3-a601b2326bb8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Lowrey CR, Nashed JY, Scott SH. Rapid and flexible whole body postural responses are evoked from perturbations to the upper limb during goal-directed reaching. Journal of Neurophysiology. 2017;117(3):1070–1083. doi: 10.1152/jn.01004.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. de Comite A, Crevecoeur F, Lefèvre P. Online modification of goal-directed control in human reaching movements. Journal of neurophysiology. 2021;125(5):1883–1898. doi: 10.1152/jn.00536.2020 [DOI] [PubMed] [Google Scholar]
  • 46. Reichenbach A, Costello A, Zatka-Haas P, Diedrichsen J. Mechanisms of responsibility assignment during redundant reaching movements. Journal of Neurophysiology. 2013;109(8):2021–2028. doi: 10.1152/jn.01052.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Reichenbach A, Franklin DW, Zatka-Haas P, Diedrichsen J. A dedicated binding mechanism for the visual control of movement. Current Biology. 2014;24(7):780–785. doi: 10.1016/j.cub.2014.02.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Reichenbach A, Diedrichsen J. Processing reafferent and exafferent visual information for action and perception. Journal of Vision. 2015;15(8):1–12. doi: 10.1167/15.8.11 [DOI] [PubMed] [Google Scholar]
  • 49. Gallivan JP, Logan L, Wolpert DM, Flanagan JR. Parallel specification of competing sensorimotor control policies for alternative action options. Nature Neuroscience. 2016;19(2):320–326. doi: 10.1038/nn.4214 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. de Brouwer AJ, Jarvis T, Gallivan JP, Flanagan JR. Parallel Specification of Visuomotor Feedback Gains during Bimanual Reaching to Independent Goals. Eneuro. 2017;4(2):ENEURO.0026–17.2017. doi: 10.1523/ENEURO.0026-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. de Brouwer AJ, Gallivan JP, Flanagan JR. Visuomotor feedback gains are modulated by gaze position. Journal of Neurophysiology. 2018;120(5):2522–2531. doi: 10.1152/jn.00182.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Scheidt RA, Reinkensmeyer DJ, Conditt MA, Rymer WZ, Mussa-Ivaldi FA. Persistence of motor adaptation during constrained, multi-joint, arm movements. Journal of neurophysiology. 2000;84(2):853–862. doi: 10.1152/jn.2000.84.2.853 [DOI] [PubMed] [Google Scholar]
  • 53. Raftery AE, Kass RE. Bayes Factors. Journal of the American Statistical Association. 1995;90(430):773–795. doi: 10.1080/01621459.1995.10476572 [DOI] [Google Scholar]
  • 54. Harris CM, Wolpert DM. Signal-dependent noise determines motor planning. Nature. 1998;394:780. doi: 10.1038/29528 [DOI] [PubMed] [Google Scholar]
  • 55. Todorov E, Jordan MI. Optimal feedback control as a theory of motor coordination. Nature Neuroscience. 2002;5(11):1226–1235. doi: 10.1038/nn963 [DOI] [PubMed] [Google Scholar]
  • 56. Verstynen T, Sabes PN. How Each Movement Changes the Next: An Experimental and Theoretical Study of Fast Adaptive Priors in Reaching. Journal of Neuroscience. 2011;31(27):10050–10059. doi: 10.1523/JNEUROSCI.6525-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Rigoux L, Guigon E. A Model of Reward- and Effort-Based Optimal Decision Making and Motor Control. PLoS Computational Biology. 2012;8(10). doi: 10.1371/journal.pcbi.1002716 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Berniker M, Penny S. A normative approach to neuromotor control. Biological Cybernetics. 2019;113(1-2):83–92. doi: 10.1007/s00422-018-0777-7 [DOI] [PubMed] [Google Scholar]
  • 59. Wada Y, Kawabata Y, Kotosaka S, Yamamoto K, Kitazawa S, Kawato M. Acquisition and contextual switching of multiple internal models for different viscous force fields. Neuroscience Research. 2003;46(3):319–331. doi: 10.1016/S0168-0102(03)00094-4 [DOI] [PubMed] [Google Scholar]
  • 60. Lee JY, Schweighofer N. Dual Adaptation Supports a Parallel Architecture of Motor Memory. Journal of Neuroscience. 2009;29(33):10396–10404. doi: 10.1523/JNEUROSCI.1294-09.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Howard IS, Ford C, Cangelosi A, Franklin DW. Active lead-in variability affects motor memory formation and slows motor learning. Scientific Reports. 2017;7(1):1–12. doi: 10.1038/s41598-017-05697-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Forano M, Schween R, Taylor JA, Hegele M, Franklin DW. Direct and indirect cues can enable dual adaptation, but through different learning processes. Journal of Neurophysiology. 2021;126(5):1490–1506. doi: 10.1152/jn.00166.2021 [DOI] [PubMed] [Google Scholar]
  • 63. Wolpert DM, Kawato M. Multiple paired forward and inverse models for motor control. Neural Networks. 1998;11(7-8):1317–1329. doi: 10.1016/S0893-6080(98)00066-5 [DOI] [PubMed] [Google Scholar]
  • 64. Ahmadi-Pajouh MA, Towhidkhah F, Shadmehr R. Preparing to reach: Selecting an adaptive long-latency feedback controller. Journal of Neuroscience. 2012;32(28):9537–9545. doi: 10.1523/JNEUROSCI.4275-11.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Hayashi T, Yokoi A, Hirashima M, Nozaki D. Visuomotor Map Determines How Visually Guided Reaching Movements are Corrected Within and Across Trials. eNeuro. 2016;3(3):1–13. doi: 10.1523/ENEURO.0032-16.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Maeda RS, Gribble PL, Pruszynski JA. Learning New Feedforward Motor Commands Based on Feedback Responses. Current Biology. 2020;30(10):1941–1948.e3. doi: 10.1016/j.cub.2020.03.005 [DOI] [PubMed] [Google Scholar]
  • 67. Voudouris D, Fiehler K. Dynamic temporal modulation of somatosensory processing during reaching. Scientific Reports. 2021;11(1):1–12. doi: 10.1038/s41598-021-81156-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Wagner MJ, Smith MA. Shared internal models for feedforward and feedback control. Journal of Neuroscience. 2008;28(42):10663–10673. doi: 10.1523/JNEUROSCI.5479-07.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Maeda RS, Kersten R, Pruszynski JA. Shared internal models for feedforward and feedback control of arm dynamics in non-human primates. European Journal of Neuroscience. 2021;53(5):1605–1620. doi: 10.1111/ejn.15056 [DOI] [PubMed] [Google Scholar]
  • 70. Cluff T, Scott SH. Rapid Feedback Responses Correlate with Reach Adaptation and Properties of Novel Upper Limb Loads. Journal of Neuroscience. 2013;33(40):15903–15914. doi: 10.1523/JNEUROSCI.0263-13.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Franklin S, Franklin DW. Feedback Gains modulate with Motor Memory Uncertainty. Neurons, Behavior, Data analysis, and Theory. 2021;5(2):1–28. doi: 10.51628/001c.22336 [DOI] [Google Scholar]
  • 72. Coltman SK, Gribble PL. Time course of changes in the long-latency feedback response parallels the fast process of short-term motor adaptation. Journal of Neurophysiology. 2020;124(2):388–399. doi: 10.1152/jn.00286.2020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Gritsenko V, Kalaska JF. Rapid online correction is selectively suppressed during movement with a visuomotor transformation. Journal of Neurophysiology. 2010;104(6):3084–3104. doi: 10.1152/jn.00909.2009 [DOI] [PubMed] [Google Scholar]
  • 74. Yousif N, Diedrichsen J. Structural learning in feedforward and feedback control. Journal of Neurophysiology. 2012;108(9):2373–2382. doi: 10.1152/jn.00315.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Kasuga S, Telgen S, Ushiba J, Nozaki D, Diedrichsen J. Learning feedback and feedforward control in a mirror-reversed visual environment. Journal of Neurophysiology. 2015;114(4):2187–2193. doi: 10.1152/jn.00096.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Oostwoud Wijdenes L, Van Beers RJ, Medendorp WP. Vestibular modulation of visuomotor feedback gains in reaching. Journal of Neurophysiology. 2019;122(3):947–957. doi: 10.1152/jn.00616.2018 [DOI] [PubMed] [Google Scholar]
  • 77. Izawa J, Rane T, Donchin O, Shadmehr R. Motor Adaptation as a Process of Reoptimization. Journal of Neuroscience. 2008;28(11):2883–2891. doi: 10.1523/JNEUROSCI.5359-07.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Crevecoeur F, Kurtzer I, Scott SH. Fast corrective responses are evoked by perturbations approaching the natural variability of posture and movement tasks. Journal of Neurophysiology. 2012;107(10):2821–2832. doi: 10.1152/jn.00849.2011 [DOI] [PubMed] [Google Scholar]
  • 79. Howard IS, Franklin S, Franklin DW. Asymmetry in kinematic generalization between visual and passive lead-in movements are consistent with a forward model in the sensorimotor system. PLoS ONE. 2020;15(1):1–21. doi: 10.1371/journal.pone.0228083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Crevecoeur F, Kurtzer I, Bourke T, Scott SH. Feedback responses rapidly scale with the urgency to correct for external perturbations. Journal of Neurophysiology. 2013;110(6):1323–1332. doi: 10.1152/jn.00216.2013 [DOI] [PubMed] [Google Scholar]
  • 81. Oldfield RC. The assessment and analysis of handedness: The Edinburgh inventory. Neuropsychologia. 1971;9(1):97–113. doi: 10.1016/0028-3932(71)90067-4 [DOI] [PubMed] [Google Scholar]
  • 82. Howard IS, Ingram JN, Wolpert DM. A modular planar robotic manipulandum with end-point torque control. Journal of Neuroscience Methods. 2009;181(2):199–211. doi: 10.1016/j.jneumeth.2009.05.005 [DOI] [PubMed] [Google Scholar]
  • 83.JASP Team. JASP (Version 0.14.1)[Computer software]; 2021. Available from: https://jasp-stats.org/.
  • 84. Kuo AD. An optimal control model for analyzing human postural balance. IEEE Transactions on Biomedical Engineering. 1995;42(1):87–101. doi: 10.1109/10.362914 [DOI] [PubMed] [Google Scholar]
  • 85. Todorov E. Stochastic optimal control and estimation methods adapted to the noise characteristics of the sensorimotor system. Neural Computation. 2005;17(5):1084–1108. doi: 10.1162/0899766053491887 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010192.r001

Decision Letter 0

Adrian M Haith, Samuel J Gershman

2 Feb 2022

Dear Mr Česonis,

Thank you very much for submitting your manuscript "Contextual cues are not unique for motor learning: Task-dependant switching of feedback controllers" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

Overall, the reviewers were generally positive about the paper and found the paper to be technically sound. Reviewer 2, however, raised an important issue about the novelty of the work, pointing out several papers that have already provided evidence for the general claim that people are able to flexibly switch between different controllers in a task-dependent manner. I believe (and the reviewer seems to agree) there is sufficient novelty in the current paper to warrant publication. However, I agree with Reviewer 2 that the paper needs to better acknowledge previous studies that have already addressed the general question of switching between feedback controllers (and reached the same conclusion), and should more clearly distinguish the specific contribution of this paper in relation to that prior work.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Adrian M Haith

Associate Editor

PLOS Computational Biology

Samuel Gershman

Deputy Editor

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Reviewer #1: This study has demonstrated that the motor system can flexibly recruit different feedback controllers depending on the type of task (hitting and stop tasks). It seems that previous studies have already demonstrated this. However, the authors pointed out that this was not true: The identical feedback controller could produce different feedback responses, and therefore observing different feedback responses is not solid evidence for the recruitment of different feedback controllers. Interestingly, their previous study (Cesonis and Franklin 2020) had demonstrated that the identical feedback controller induced the identical feedback responses when they were examined at the same time-to-target.

They experimentally demonstrated that different feedback responses were induced for the hitting and reaching tasks, even when the responses were evaluated at the same time-to-target and that this result was reproduced by the OFC models with different cost functions for both tasks. Furthermore, the schedule of tasks (i.e., block or mixed schedule) did not influence the result, enabling them to conclude that the motor system can flexibly and rapidly switch between different feedback control policies.

It is well-known that the motor system can create and retrieve different feedforward controllers when the appropriate cue is provided. This study is a significant contribution demonstrating that the motor system also has the ability to switch between different control policies depending on conditions. I was also intrigued by the idea that the feedback response at the same time-to-target can be used as a probe that examined the feedback controllers. Since this study was well-designed, I only have a few very minor issues.

1. Long-stop condition: This is an important control condition. The simulation result was presented, but the experiment was not performed in this condition. Do the authors expect that the experimental result would match the simulation?

2. Effect of schedule: I understood that there were no systematic differences between the blocked and mixed schedule (Fig.4). I do not oppose the idea of rapid switching, but I am curious about if the aftereffect from one condition to another did not exist. In the mixed schedule in which the hitting and stop tasks were randomly changed, one condition should continue several trials, which might obscure the effect of interference. The difference could be observed when analyzing the data in the trials only after switching?

3. Fig.2.B, C and 3E, F: The intensity was likely to be calculated as an averaged feedback response (i.e., force) in 180ms and 230ms from the perturbation onset. There seemed some inconsistencies between Fig.2 and 3. The force output shown in Fig.2 was approximately 0.3N for the earliest perturbation, while the intensity in Fig.3 was 0.8N. Furthermore, the difference in the force output between both tasks (Fig.2B and C) is much greater than the difference in the intensity (Fig.3E, F).

4. line 117-118 on page 9: There are garbled characters before 1cm/s and 20cm/s.

5. Line 356 on page 12: “In a recent study” should be “A recent study”?

Reviewer #2: The paper by Cesonis and Franklin presents data about changes in control policies between trials that differed based on their instructions. In one set of trials, participants were instructed to reach and stop at a target. In another set of trials, they were instructed to hit the target. Based on the premise that these different instructions translate into flexible changes in control strategies, it is shown that participants indeed switched controller from one trial to the next. The paper is interesting and all technical aspects seem to have been performed well. But it its current form I cannot endorse the paper because it ignores or dismisses previous work on the same topic.

As is, the paper cannot be accepted because it is not novel. A first point is the switching of controllers across trials, which has long been known. In fact, it has been included in the experimental design of many previous studies. Without putting emphasis on it, Nashed and colleagues (2012) already analysed the difference between blocked and random design to highlight trial-by-trial changes in goal structure. Later, Lowrey and colleagues (J Neurophysiol, 117: 1070–1083, 2017) used a random design with trials towards a round target or a bar target randomly interleaved and highlighted task-dependent switching of controllers. Orban de Xivry (PloS one 8 (6), e66013) compared switching of controllers with a blocked or random schedule and found a switch albeit smaller. The switching per se is not new.

More striking was the effort to minimize our recent study investigating switching of controllers across and within trials by de Comité et al. (125 (5), 1883-1898), which again showed the same results. I find that the authors dismissed our previous findings. First, we included also a change in controller across trials in the absence of online perturbation of the target shape as a control condition, reproducing the results of Nashed, Lowrey and colleagues. Then, it is written “the change in control signals alone does not directly imply a change in the feedback controller”. Yet, in this previous study, the change in control was evoked by a change in the structure, not the state (for instance position and velocity) of the goal. The authors then argue vaguely that this change in control can result from a change in the state that includes the target representation. The statement is inaccurate at best, or misleading if intentional. I believe the authors have missed that, computationally, changing the structure of the target from a square to a bar corresponded to a change in task demand. The critique was unfair and the author did not give proper credit to this previous work.

Towards assessing the paper as objectively as possible, it must be recognized that computationally again, our previous experiment corresponded to manipulating the factor called wp,t in their equation (3), which either has two non-zero components (penalizing x, and y) or one (penalizing only y). What the authors have done is testing the same idea by manipulating the other factors weighting velocity and force in the cost function. It is an important and interesting contribution on its own, but it does test the same idea only with other state variables. As such it deserves publication provided that previous contributions are properly acknowledged. The change of controllers has been demonstrated before, this is not new, what is tested here is whether this result also applies to hit and stop conditions, which are believed to require a change in the controller not linked to target geometry but to the penalty on the final state. Both manipulations imply a change in cost-function that must translate into an update of feedback gains. A substantial reworking is necessary to make this aspect very clear and upfront.

Specific points:

I did not find any definition of the time-to-target in the data, apologies if I missed it. Since there is no variability in time-to-target of Fig. 4b, I am assuming that it is measured as the difference between the perturbation time and some time limit, but since the perturbation were triggered in position I failed to understand how there was no variability in this quantity.

The authors equate urgency and time-to-target, which is a bit hazardous because the urgency may refer to an instantaneous pressure signal, without predicting how long or how far in time will movements be completed.

With respect to computational modelling I had concerns about the comparison made between the forces measured experimentally in a force channel and the forces produced in simulations. Was there a simulation of a force channel? Is there an impact on the correspondence between measured and simulated variables? Pls. add justification to the paper.

I found that the overall narrative was sometimes unclear. It has been known for a long time that between blocks of trials with different instructions, humans can change their goal-directed policy. This in my view is the main contribution of the use of optimal feedback control. In contrast, in the literature on motor adaptation shows gradual changes in internal models in ABA learning designs, with unlearning and interferences. Thus, it seemed clear that internal models for reaching do not change as quickly as task-dependent policies. The case of motor adaptation requires special contextual cues that seem less important for task-related constraints. The analogy is also limited, as context-dependent motor adaptation is associated with different feedforward controllers, while in the case of change in task-demands the authors use a model without feedforward pass. If the results must be interpreted in light of context-dependent learning I admit I see more differences between these paradigms than what is suggested.

Respectfully submitted,

F. Crevecoeur

Reviewer #3: In this paper the authors investigate to what extent healthy participants can switch between feedback controllers across tasks. This issue is clearly of interest to the motor control community. Indeed, although previous work has shown that task-dependant switching of feedforward controllers was possible, to my knowledge this issue has never been addressed in the context of feedback controllers. By comparing how participants react to visual perturbations (i.e. unexpected lateral target jumps) when performing either a reaching (stop) or an intercepting (hit) task, the authors can probe that these two tasks rely on separate feedback controllers. Importantly, evidence for separate controllers can be found even when participants are required to alternate reaching and intercepting trials. The authors conclude that contextual switching of feedback controllers is possible (as previously observed for feedforward controllers). In general, I found the manuscript very well written, with clear objectives, using a careful/clever design, and offering a lot of technical details. Still, I believe that this manuscript can be improved in several places (see my comments below).

Major Comments

1) Methodology (line 593-596): The authors state that all participants (n=14) performed first the blocked-design conditions then performed the mixed-design conditions (alternation of tasks). I wonder to what extent this design has favored the emergence of separate feedback controllers. When a participant performs a block of 416 trials while executing the same task over and over, it seems adequate to develop a feedback controller that is specific to this task. When the participant is subsequently asked to alternate trials in one task with trials in the second task, one may assume that those preexisting (separate) controllers may facilitate the switching of controllers across tasks. Now it is less clear what would have happened if participants had been exposed first to the mixed-design schedule (rapid alternation of tasks). One may wonder whether the emergence of separate controllers would have been possible. Although, I feel confident with the observations reported by the authors, I believe this extra information would impact the extent of the conclusions formulated by the authors. Obviously, I understand that investigating this issue necessitate to collect extra participants (but one may argue that the current number of participants is on the low side), but if such material is not provided, the authors should at least acknowledge the limitations of their experimental design.

2) Data analysis: In Fig 4B feedback response intensities are provided in each task (hit and stop) and for each schedule (blocked vs. mixed). Visual inspection suggests that the difference across tasks is greater in the blocked design (red vs. dark blue curve) than in the mixed design (orange vs. light blue). I wonder if the authors performed some statistical analyses that addressed specifically this issue (apart from the lack of condition/schedule interaction), as a smaller disparity across controllers matches with one the hypotheses formulated initially. One would also appreciate further analyses of the mixed schedule to assess the presence of possible carry-over effects when switching across tasks. For instance, one could assess whether the gain of feedback responses in a stop/reach trial increases as the number of preceding hit/intercepting trials increases. A possible lack of carry-over effects across tasks would strengthen the conclusions made by the authors (i.e. fully independent feedback controllers allowing instantaneous switch).

3) Learning: As far I understand, all available (perturbed) trials were considered to assess the visuofeedback gains associated with each task. It is unclear to what extent these responses are modulated by prior experience with the task (and perturbation). It would be helpful to clarify whether these feedback controllers were preexisting/innate, or whether they slowly build up during practice with the task. One could imagine that during initial training, feedback controllers were rather similar for both tasks, but progressively diverged with further experience. Providing some information about the time course of feedback responses over the blocked schedule would be helpful. For instance, one could contrast responses in early and late trials. Obviously, I understand that the statistical power will drop, but clarifying how these feedback controllers appeared/emerged in the first placed seems critical.

Minor comments

Line 117-118: Remove the extra characters preceding the value of velocity

Line 330: Please consider that not all studies support the notion that feedback and feedforward mechanisms are necessarily shared (Abdelghani and Tweed 2010; Gritsenko and Kalaska 2010; Kasuga et al. 2015; Yousif and Diedrichsen 2012).

Line 418: The authors emphasized that “time-to-target” is a better reference frame to monitor feedback responses. This variable resonates with another variable “time-to-contact” (Tau) that is very popular in the field of ecological psychology. I am wondering if the authors considered a possible connection with this field, or this is just a fortuitous coincidence?

Line 505: Was there a rational for the number of participants (n=14)? Nowadays, more and more studies are considering sample size of at least 20 participants.

Line 536: In the intercepting task, people were given feedback to hit the target at a given speed, but were they given explicit instructions regarding the fact that their peak velocity had to coincide or not with the hit?

Line 598: Did you have any constraints regarding the randomization of task order? For instance, was it possible for participants to experience 3 (or possibly more) consecutive trials of the same task? Did you set a limit for this (i.e. maximum number of consecutive trials in the same task)? More information on this issue would be helpful in the method.

Line 631: I assume that the neutral force profile was obtained from the non-perturbed trials performed with the channel, correct? I presume this force profile was non monotonic (possibly velocity dependent), correct? How did you deal with the fact that movement time varied across trials?

Line 637: Although I do understand the rational for the normalization of force responses across participants, it does not seem to match with the unit employed in Figure 3 and 4. In both figures the intensity of feedback responses is presented in Newtons. Please clarify how the normalization procedure allows the preservation of feedback responses in Newtons.

Fig 1: The authors contrast two extreme situations: a single versus 2 independent controllers. One could imagine an intermediate scenario in which the single controller carries history dependent effect (see also my second main comment on this). Perhaps the authors could acknowledge this third option in the caption or somewhere in the text.

Fig 3A&D: For both tasks, peak velocity is about 10% smaller in the experimental data than in the model predictions (60 cm/s). Even if the predictions would probably be qualitatively similar, I wonder why the authors did not readjust the settings for the model predictions.

Figure 3B: It is a bit odd to provide a single label (B) to account for two graphs (time-to-target and time-to-movement end) as opposed to all the other graphs (one label per graph). I would suggest inserting an extra label for the neighboring graph (with final label reaching G instead of F).

Fig 3BC: The model predictions suggest that sometimes the intensity of feedback responses can be negative (below 0). This feature is not visible in the experimental data (see Fig 3E) which presumably makes sense given the way the force data were analyzed (as force direction did not seem to matter). Altogether I am left wondering how to reconcile these separate observations. Please clarify.

Fig 3E+Line 153: This graph clearly shows different feedback responses across tasks, and the authors capitalize on these differences. Apart from being consistent with optimal feedback control, can the authors provide a (simple) rational as to why, during the first part of the movement, feedback responses should be larger for the hit condition than for the stop one?

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010192.r003

Decision Letter 1

Adrian M Haith, Samuel J Gershman

5 Apr 2022

Dear Mr Česonis,

Thank you very much for submitting your manuscript "Contextual cues are not unique for motor learning: Task-dependant switching of feedback controllers" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

The reviewers were generally very satisfied with the revisions to the manuscript. Reviewer 2 raised some remaining concerns, however. I don't think these are major issues, but I hope you might be able to address them (as well as those raised by Reviewer 3) by providing a response to the reviewers' concerns and revisions to the manuscript where these may be appropriate.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Adrian M Haith

Associate Editor

PLOS Computational Biology

Samuel Gershman

Deputy Editor

PLOS Computational Biology

***********************

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Reviewer #1: I was satisfied with the revision. I have no further additional comments.

Reviewer #2: The authors have clarified their contribution in regard of previous work investigating changes in target structure and the paper has improved. There remain points of disagreement that I would recommend the authors to consider.

Regarding the impact of the channel I am surprised to see the arguments developed, visuomotor delays are closer to 100ms than 150ms, see for instance Izawa and Shadmehr (JNS, 2008) who documented the effect of a jump on acceleration. Regarding the stretch response, the statement seems inaccurate in my view because the smallest perturbation loads applied very early during reaching are known to evoke short and long-latency responses (J Neurophysiol, 2012, 107: 2821-2832). In all it seems that there can be a feedback-mediated response to the channel, which is not captured in the simulation since there was no difference with or without channel. I believe that this discrepancy between model and data could be expanded a bit. It is strange to argue that the experiments aimed at probing control while using channels, then use a model without channel to simulate control and verify that results in the channel are not different.

I did not follow the authors’ response regarding the difference between change in control and adaptation, and I did not see any addition to the manuscript about this point. An indirect argument is made about the fact that the task was not novel but to my knowledge, dual adaptation can be compromised during exposures longer than the training sessions used here. In addition, the authors suggest that it would be an interesting control to test first the random schedule. Indeed, and I would be willing to bet that it works just fine or to a slightly lesser extent. On the contrary dual adaptation builds up over lover periods of time, and visual cues (and others) have been shown ineffective in evoking different contexts or goals. These are clear differences and I suggest to discuss the limitations of the analogy with context-dependent adaptation in more detail.

Reviewer #3: I am generally satisfied with the responses/changes provided by the authors to my concerns. Specifically, I appreciated their effort to assess the presence of possible carry-over effects across tasks. Evidence for instantaneous switching is strengthened. I have only a few minor suggestions left to improve their manuscript.

1) Fig S1: Please state the nature of error bars (SEM or 95% confidence interval)

2) Fig S2: It I unclear why data on panel B is displayed in arbitrary unit, whereas it is displayed in raw unit (N) in panel C. The resolution of the Y axis in panel C is low compared to other panels (2 versus 6 values). Please accommodate graph C accordingly. As evoked previously for Fig S1, please state the nature of error bars (SEM or 95% confidence interval).

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Frederic Danion

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

References:

Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010192.r005

Decision Letter 2

Adrian M Haith, Samuel J Gershman

9 May 2022

Dear Mr Česonis,

We are pleased to inform you that your manuscript 'Contextual cues are not unique for motor learning: Task-dependant switching of feedback controllers' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Adrian M Haith

Associate Editor

PLOS Computational Biology

Samuel Gershman

Deputy Editor

PLOS Computational Biology

***********************************************************

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010192.r006

Acceptance letter

Adrian M Haith, Samuel J Gershman

6 Jun 2022

PCOMPBIOL-D-22-00004R2

Contextual cues are not unique for motor learning: Task-dependant switching of feedback controllers

Dear Dr Česonis,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Olena Szabo

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Initial learning of feedback controllers.

    Here we verify whether our participants developed different feedback controllers for hit and stop tasks over the blocked schedule, or if these controllers were innate. To do so, we analyse the visuomotor responses in the first few blocks of the study, showing that these responses can be considered innate.

    (PDF)

    S2 Text. Effect of condition clustering in mixed schedule.

    Visuomotor responses were analysed in trials immediately following the condition switch (hit to stop or stop to hit) in mixed schedule. Analysis shows same regulation as in the entirety of the mixed schedule, implying rapid switching.

    (PDF)

    S1 Fig. OFC model simulations in channel trials.

    Model simulations performed in cbannel trials, instead of free movement responses. A. Velocity profiles for stop (blue), hit (red) and long-stop (green) conditions. B. Model simulations of feedback intensities as a function of time-to-target and C. position for the three conditions. Simulations in the channel trials qualitatively predict the same regulation as do the simulations of free movements.

    (EPS)

    Attachment

    Submitted filename: Reviewer_responses.pdf

    Attachment

    Submitted filename: response_to-reviewers.pdf

    Data Availability Statement

    All the Jupyter notebooks for the data analysis, pre-processed experimental data and statistical analysis conducted in this article are available at https://doi.org/10.6084/m9.figshare.17113904.v1.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES