Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2022 Jun 8;18(6):e1009338. doi: 10.1371/journal.pcbi.1009338

An approximate stochastic optimal control framework to simulate nonlinear neuro-musculoskeletal models in the presence of noise

Tom Van Wouwe 1,*, Lena H Ting 2,3, Friedl De Groote 1
Editor: Adrian M Haith4
PMCID: PMC9176817  PMID: 35675227

Abstract

Optimal control simulations have shown that both musculoskeletal dynamics and physiological noise are important determinants of movement. However, due to the limited efficiency of available computational tools, deterministic simulations of movement focus on accurately modelling the musculoskeletal system while neglecting physiological noise, and stochastic simulations account for noise while simplifying the dynamics. We took advantage of recent approaches where stochastic optimal control problems are approximated using deterministic optimal control problems, which can be solved efficiently using direct collocation. We were thus able to extend predictions of stochastic optimal control as a theory of motor coordination to include muscle coordination and movement patterns emerging from non-linear musculoskeletal dynamics. In stochastic optimal control simulations of human standing balance, we demonstrated that the inclusion of muscle dynamics can predict muscle co-contraction as minimal effort strategy that complements sensorimotor feedback control in the presence of sensory noise. In simulations of reaching, we demonstrated that nonlinear multi-segment musculoskeletal dynamics enables complex perturbed and unperturbed reach trajectories under a variety of task conditions to be predicted. In both behaviors, we demonstrated how interactions between task constraint, sensory noise, and the intrinsic properties of muscle influence optimal muscle coordination patterns, including muscle co-contraction, and the resulting movement trajectories. Our approach enables a true minimum effort solution to be identified as task constraints, such as movement accuracy, can be explicitly imposed, rather than being approximated using penalty terms in the cost function. Our approximate stochastic optimal control framework predicts complex features, not captured by previous simulation approaches, providing a generalizable and valuable tool to study how musculoskeletal dynamics and physiological noise may alter neural control of movement in both healthy and pathological movements.

Author summary

Model-based simulations have advanced our insight in how movement is controlled, but computational limitations have prevented us from simultaneously considering the effects of nonlinear musculoskeletal dynamics and noise on neural control on movement strategies. Here we present a novel simulation framework that addresses this methodological gap and demonstrate its potential to predict features of motor control that have not been predicted previously. Our framework enables neural control mechanisms to be simulated and the optimal control strategy in the presence of noise to be computed. We demonstrate for the first time that muscle co-contraction–a motor strategy thought to be energetically costly–can help to reduce the effort required to stabilize standing postural control, even when feedback is present. We also demonstrate that multi-segmental models of the arm actuated by muscle can predict complex reach and perturbed reach trajectories that were not explained by simplified models. The ability to predict muscle activation patterns and kinematics of joints extends the theory of optimal control to variables that can be used to directly explain muscle and joint coordination. Our computational approach is broadly applicable and may be extendable to other normal and impaired movements, as well as aid in the design of assistive devices.

Introduction

Predictive simulations are powerful tools to study the neuromechanics of movement [1,2]. Movement simulations are typically based on optimal control as a theory of motor coordination to solve the redundancy problem, i.e. to determine which of the many possible movement strategies is used to achieve a given movement goal [3]. Optimal control theory has also been used to demonstrate how sensorimotor noise affects movement control strategies. However, current approaches focus either on detailed representations of musculoskeletal dynamics while neglecting physiological noise [46] or on simulating the effects of physiological noise while simplifying musculoskeletal dynamics [79]. On the one hand, deterministic simulations based on complex models have advanced our understanding of how musculoskeletal dynamics shapes movement. On the other hand, stochastic simulations based on simple models have shown that noise shapes movement kinematics and variability as well as underlying control mechanisms, including modulation of feedback. Accounting for the interaction between musculoskeletal dynamics and physiological noise may therefore be important to predict physiologically-realistic control strategies and movements. However, the limited efficiency of available computational tools to simulate motor behavior in the presence of noise has hampered the use of more accurate models of musculoskeletal dynamics in stochastic simulations of movement [10]. Here, we present and test a generalizable computational framework to simulate the effect of sensorimotor noise on motor control and movement in nonlinear musculoskeletal systems.

Accounting for sensorimotor noise in movement simulations is crucial to capture movement variability and sensorimotor feedback modulation. In 1998, Harris and Wolpert minimized endpoint variability in the presence of signal-dependent motor noise to simulate open-loop controlled reaching and saccadic eye movements [11]. Including signal-dependent motor noise led to physiologically realistic bell-shaped velocity profiles and the experimentally observed trade-off between reaching time and reaching accuracy, i.e., Fitts’ law. Although Harris and Wolpert clearly demonstrated the importance of accounting for noise, their stochastic simulations did not include feedback control. In general, feedback control improves performance over open-loop control in the presence of sensorimotor noise [12]. Therefore, Todorov and Jordan introduced feedback control in stochastic simulations of movement and tested optimal feedback control (OFC) as a theory of motor coordination [3,13]. The most important prediction of optimal feedback control is arguably the minimum intervention principle: “deviations from the average trajectory are only corrected when they interfere with the task goal” [13]. Optimal feedback control has since explained many kinematic and control features of reaching and standing balance [1417]. However, these studies mostly used linearized models that do not account for critical nonlinearities in movement.

Some studies have accounted for nonlinearities in musculoskeletal dynamics but relied on limiting assumptions on the control policy. For example, Li and Todorov [1820] introduced iterative linear-quadratic-Gaussian control (iLQG), which solves the nonlinear stochastic optimal control problem by iteratively updating feedforward and feedback controls based on a local linearization of the dynamics around the current state trajectory. However, iLQG requires the control law to be time-varying, which might not be physiologically realistic. In addition, it requires a quadratic cost function and it requires specifying movement accuracy by a penalty term in the cost function [18,21]. As a result, the weights in the cost function need to be hand-tuned to achieve realistic results and an obligatory trade-off between accuracy and effort that may not necessarily be physiological emerges. Stochastic optimal control simulations of movement based on nonlinear models have remained rare, possibly because of their high computational cost.

Recent computational advances have drastically improved the efficiency of deterministic movement simulations enabling the use of complex and nonlinear musculoskeletal models and their application to stochastic movement simulations opens perspectives to further improve the realism of movement simulations. The introduction of direct collocation and automatic differentiation greatly speeded up simulating deterministic movements enabling the use of complex and nonlinear musculoskeletal models. In deterministic simulations, optimal control can be described by open-loop control trajectories. Therefore, these simulations can be formulated as trajectory optimization problems [5]. The introduction of direct collocation approaches to solve trajectory optimization problems improved computational efficiency compared to shooting methods by decreasing the sensitivity of the optimization objective to the decision variables [22]. Shooting methods use time-marching integration to evaluate the cost and constraint functions based on the current guess of the initial state and controls. Due to time-marching integration over the entire movement horizon, shooting methods have poor convergence and long computation times when the dynamics are stiff, as in many biological movements. In contrast to direct shooting, direct collocation eliminates the need for time-marching integration by adding the parameterized states to the decision variables and by adding the discretized dynamic equations as constraints to the optimization problem. Computational efficiency has further been improved by implicit formulations of the system dynamics to improve the numerical condition of the optimization problem [23] and the use of automatic differentiation to compute derivative information needed by gradient-based solvers [24]. These methodological advances have enabled rapid predictive simulations based on complex musculoskeletal models [2,6], which have been applied to test optimality principles underlying human movement [4] and to study the effect of changes in the musculoskeletal system on movement [6,25].

Computational advances that have improved the efficiency of deterministic simulations have been leveraged to stochastic optimal control simulations by reformulating stochastic simulations as approximate deterministic simulations. This approximate reformulation was originally applied for control in robotics and engineering [26]. Houska et al. [26,27] and Gillis et al. [28] proposed to transform the stochastic optimal control problem into an approximate augmented deterministic optimal control problem by approximating the generally non-Gaussian state trajectory distribution by a Gaussian state trajectory that can be described by the mean state trajectory and the state covariance trajectory. The propagation of the covariance matrix is described by Lyaponuv differential equations, which assume local invariance of the system dynamics around the mean trajectory similar to the Extended Kalman Filter [29]. Berret et al. have applied this approach to simulations of reaching [30]. However, they only considered feedforward control [30] or added feedback control to the pre-computed optimal feedforward control in a post-processing step using the LQG framework [31]. This does not seem to yield an optimal solution as during the optimization of the feedforward control policy, the presence of feedback is neglected. It is therefore likely that co-optimizing feedforward and feedback contributions might result in better and more realistic control policies.

Here, we apply the approach based on approximating stochastic simulations as deterministic simulations to simultaneously compute feedforward and feedback contributions to control of nonlinear musculoskeletal dynamics in the presence of sensorimotor noise. We present a general formulation of the approach that is applicable to a broad range of movements, described by nonlinear dynamics, corrupted by additive and/or signal-dependent Gaussian noise, and controlled by time-varying feedback laws with any temporal and structural design, any type of cost function. Finally, our approach allows formulating movement accuracy as a task constraint, which may be more representative of the actual task-level goal.

We first show how our stochastic optimal control framework enables the prediction of interactions between muscle co-contraction and sensorimotor feedback, using human standing balance control as an example. While OFC has been used to predict sensorimotor feedback control of balance, individual muscles and muscle dynamics have not been modeled and it is therefore unclear whether OFC also predicts muscle co-contraction as a complementary feedforward control strategy. Specifically, OFC simulations based on joint torque-driven mechanical models capture modulation of feedback control with changes in sensory acuity [32,33], i.e. sensory reweighting. We hypothesized that OFC simulations will also capture modulation of feedforward co-contraction when history dependent muscle properties such as short-range-stiffness (SRS) [3436] are taken into account. We simulated perturbed (platform rotations and translations) standing balance based on a multi-sensory, muscle-driven model with both feedforward and feedback control in the presence of sensory and motor noise. We demonstrate that our stochastic optimal control framework predicts contributions of both feedforward, i.e. muscle co-contraction, and feedback control during standing balance that depend on movement task, sensory acuity and muscle properties.

We next demonstrate that the stochastic optimal control framework can predict perturbed reach trajectories as well as the underlying control policy consisting of feedforward and feedback contributions when nonlinear musculoskeletal dynamics are considered. Prior OFC simulations based on a point-mass model capture changes in nominal reach trajectories and feedback control depending on target shape and the presence of obstacles (i.e. task goal) [8]. However, perturbed reach trajectories deviated considerably from experimental observations. We hypothesized that more accurate representations of nonlinear multi-joint and muscle mechanics would result in more realistic reach trajectories. We simulated reaching while altering target shape and stability of the environment (i.e. divergent force-field) [37,38]. We demonstrate that our stochastic optimal control framework using a muscle-driven arm model results in improved predictions of reach kinematics in perturbed conditions as well as detailed predictions of sensorimotor feedback and muscle level control. In addition, our model predicted muscle activity in response to perturbations, which was not possible with previous simulation models, in agreement with experimental data [8].

Results

Approximate stochastic optimal control framework

We simulated movement trajectories and movement variability based on nonlinear musculoskeletal dynamics driven by optimal feedforward and feedback control policies in the presence of sensory and motor noise. For each specified task, optimal control policies were computed by minimizing the expected effort in the presence of noise. Effort was defined as the time integral of the sum of muscle excitations squared [39], leading to the following general stochastic optimal control problem:

mineff(t),K(t):J=E[tstarttfinaleT(t)e(t)dt]
subjectto:x˙(t)=f(x(t),e(t),wm),
g(x(t),e(t))0
e(t)=eff(t)+K(t)yfb(x(t),ws)

with E[] the expected value function, x(t) the stochastic state trajectory, including joint kinematics and muscle activations, e(t) the stochastic muscle excitation trajectories, wm a set of zero-mean Gaussian motor noise sources. The musculoskeletal dynamics f(x(t),e(t),wm) were stochastic and nonlinear. To specify different task goals, we imposed task-dependent path constraints and bounds g(x(t),e(t)). Muscle excitations e(t) consisted of time varying deterministic feedforward muscle excitations, eff(t), as well as feedback muscle excitations derived from a linear feedback law with deterministic time-varying feedback gains K(t) and task-dependent feedback error signals yfb(x(t), ws). We did not separately model short- and long-latency reflexes and considered task-level sensory feedback rather than local sensory feedback. Here, we modeled sensory noise ws as zero-mean Gaussian noise added to the feedback error signal yfb. These stochastic optimal control problems were approximated by deterministic optimal control problems and then solved using direct collocation (for details, see Methods). The deterministic approximation was based on the assumption that the stochastic state trajectories could be modelled by a Gaussian distribution and could thus be described by their expected value (mean trajectory) and variance (state covariance matrix).

Contributions of muscle co-contraction and feedback control in eyes-closed perturbed standing balance depend on movement task, sensory acuity and muscle properties

We first demonstrate that stochastic optimal control can simultaneously predict muscle co-contraction and sensorimotor feedback contributions to motor coordination, using eyes-closed perturbed standing balance as an example. Although OFC can capture experimentally identified modulations of feedback contributions that depend on sensory acuity and perturbation type [33,4042], it is unclear whether OFC predicts experimentally-observed muscle co-contraction complementing feedback during perturbed standing balance. To address limitations of prior models, we simulated standing balance with eyes closed using an inverted pendulum model of the body that was driven by a pair of antagonistic ankle muscles that have activation-dependent impedance (Fig 1A). We further investigated how the predicted muscle co-contraction depends on the model of muscle mechanical impedance, performing simulations using both a Hill-type muscle model that accounts for the force-length and force-velocity properties of muscles [22], and an augmented Hill-type muscle that also accounts for muscle short-range stiffness that was proportional to muscle activation (see Methods) [43,44]. To model baseline muscle activity during quiet standing, feedforward muscle excitations were modeled as constants. As in prior studies using torque-driven models of perturbed standing balance [15,32,45], feedback contributions were modelled as a linear combination of delayed (first order approximation with time constant of 150ms, see Methods) proprioceptive and vestibular cues, which encode the angle and angular velocity between the body and the platform, and gravity, respectively (Fig 1A). Since we simulated the eyes-closed condition our model assumes that no visual information is used for control. Gaussian sensory and motor noise was added to feedback signals and muscle excitations respectively. In accordance with the literature, vestibular inputs were more noisy than proprioceptive inputs [46]. We specified the task of upright standing by imposing a constant mean upright posture and a postural sway within limits of stability. To elicit a range of feedforward and feedback control policies, we simulated both random sagittal platform rotation (Fig 1A) and translation (Fig 1B) perturbations of different magnitudes. Co-contraction was quantified by the co-contraction index (CCI) introduced by Rudolph et al. [47]. Comparing solutions for platform rotations and translations allows sensory feedback and muscle co-contraction contributions to balance control to be differentiated as rotation and translation perturbations require different ankle muscle coordination to maintain equilibrium [48] (Fig 1A and 1B).

Fig 1.

Fig 1

A) ROTATIONS. Musculoskeletal and control model. The proprioceptive (relative angle between the body and platform: qp,q˙p) and vestibular (absolute body angle: qv,q˙v) cues encode different information that is corrupted by proprioceptive (wp) and vestibular noise (wv). The platform rotations are modeled by Gaussian noise (wSS) of the platform position (qSS), velocity (q˙SS) and acceleration (q¨SS). amuscle are muscle activations; eff,muscle are muscle feedforward excitations that are constant here; Kp2x2 are proprioceptive feedback gains and Kv2x2 are vestibular feedback gains. Experiment and simulation. Experimental data for body sway and feedback gains are from Peterka et al. [32]. RMS body sway, relative experimental proprioceptive feedback gains K˜proprioK˜proprio+K˜vest+K˜vis and relative simulated proprioceptive feedback gains KproprioKproprio+Kvest, co-contraction index (CCI) and contribution of expected effort from feedback to the total expected effort: effortFB(effortFF+effortFB) for healthy and vestibular loss (VL) subjects with and without short-range-stiffness (SRS) modeled. B) TRANSLATIONS. Musculoskeletal model. The control model is identical to the model used to simulate the response to rotation perturbations, but in the case of translation perturbations the relative and absolute body angles are identical and vestibular and proprioceptive cues encode the same information. Platform translations are modeled by Gaussian noise (wtrans) of the linear platform acceleration (x¨SS). Simulation results. Simulation. Same outcome variables as for the rotation perturbations are shown.

For platform rotations, stochastic optimal control predicted changes in postural sway and sensory reweighting with perturbation magnitude in agreement with experimental observations in healthy individuals and vestibular loss subjects (Fig 1A–simulations & experiments) [32,49,50]. In agreement with experimental findings by Peterka for healthy adults [36] (Fig 1A–experiments, healthy) and prior OFC results, body sway in our simulations increased quasi-linearly with increasing amplitude of platform rotations and saturated at larger amplitudes where the body moved in anti-phase with the platform [32,51,52] (Fig 1A–simulations, healthy). The simulations predicted sensory reweighting similar to that observed experimentally [32,52] (Fig 1A–experiments, healthy), with increased reliance on vestibular feedback at higher rotation magnitudes (Fig 1A–simulations, healthy). Because proprioceptive information was modelled to be more accurate than vestibular information [48], there was a higher reliance on proprioception and the body moved in-phase with the platform at low rotation magnitudes. When removing vestibular sensory information to simulate vestibular loss subjects, the simulation predicted a quasi-linear increase in sway with platform rotation amplitude (Fig 1A–simulations, vestibular loss) consistent with experimental observations [52] (Fig 1A–experiments, vestibular loss). In the case of vestibular loss, the relative weighting of proprioceptive information was 100% in simulation (Fig 1A–simulations, vestibular loss) and nearly 100% in experiments (Fig 1A–experiments, vestibular loss) throughout platform rotations. Peterka et al. [52] determined sensory weighting by identification of the feedback gains K˜proprio,K˜vest,K˜vis of a linear model based on collected sway data, with K˜vest=0 for vestibular loss subjects. In this study [52], the identified K˜vis in the eyes-closed condition were close to but not exactly 0, explaining the small deviations from 100% proprioceptive weighting in this condition. In contrast to sway for healthy subjects, sway for vestibular loss subjects was predicted to follow the platform motion, as proprioception is the only source of sensory information [53]. The model predicted higher total effort in vestibular loss subjects compared to healthy subjects and loss of balance when simulated sway amplitudes became unrealistic (RMS sway values larger than 5°).

During platform rotations, slightly higher, but still low levels of co-contraction were predicted for vestibular loss subjects [54] than for healthy controls (Fig 1A–simulations, CCI in healthy with SRS and VL, model with short-range stiffness). In rotations, increased joint impedance due to co-contraction opposes the anti-phase movement of the ankle joint that is optimal for upright balance with minimal effort. However, co-contraction contributes to the strategy of maintaining a constant joint angle with respect to the platform, the strategy predicted when the reference to gravity is absent, as in vestibular loss subjects [32,49,55]. Our simulations predicted co-contraction to increase above a certain perturbation magnitude in the absence of vestibular sensory information (Fig 1A–simulations, CCI in VL, model with short-range stiffness). Since all simulated strategies minimize effort, our simulations predict muscles co-contraction to reduce effort with respect to using feedback only at high perturbation magnitudes in vestibular loss subjects. However, as feedback control also increases in the absence of vestibular information, the proportion of effort due to feedback control is nevertheless higher in vestibular loss than in healthy subjects (Fig 1A–simulations, effort in VL, model with short-range stiffness).

In contrast to rotations, during platform translations stochastic optimal control predicted similar increases in postural sway with increasing perturbation magnitude in healthy and vestibular loss simulations, also found experimentally [56] (Fig 1B—simulations). As perturbation magnitude increased, the proportion of proprioceptive feedback used in balance control decreased, but in contrast to rotations, shifted toward an equal contribution of proprioceptive and vestibular feedback. In translations¸ both sensory signals encode the same information but with different uncertainty levels, therefore this shift towards more equal contributions with increasing perturbation magnitude may be explained by the sensory uncertainty becoming increasingly small in comparison to the kinematic deviations introduced by the perturbations. The decreased contribution of proprioceptive feedback with increasing translation magnitude has not been tested experimentally and should be considered a model prediction, allowing further validation of our simulations.

In contrast to rotations, and consistent with experimental findings [57], much larger contributions of muscle co-contraction were predicted with increasing translation perturbation magnitude in both healthy and vestibular loss subjects (Fig 1A and 1B–simulations, CCI). In translations, increased joint impedance due to co-contraction reduced body sway, which in turn reduced the relative effort of feedback corrections as perturbation magnitude increased (Fig 1B–simulations, effort).

Taken together, stochastic optimal control predicted muscle co-contraction as a complementary strategy to sensorimotor feedback depending on the mechanical properties of the muscle, perturbation type and magnitude, and sensory acuity. Muscle co-contraction was only predicted in the simulations where the Hill-type muscle model was augmented with short-range-stiffness [43]. Although some amount of muscle co-contraction was predicted in both rotation and translation perturbations, muscle co-contraction only considerably decreased the proportion of muscle effort due to feedback control in high-amplitude translation perturbations, where muscle co-contraction helps to maintain upright posture.

Stochastic optimal control of goal-directed reaching predicts experimental kinematics, movement variability and feedback modulation across different reaching tasks

In a second set of simulations, we demonstrate that stochastic optimal control using a nonlinear muscle-driven arm model can predict how nominal as well as perturbed reaching trajectories change as a function of task goals and environmental dynamics. We simulated the three point-to-point reaching tasks (Fig 2A) described in Nashed et al. [8]: reaching to a circular target (circle), reaching to a horizontal bar (bar), and reaching to a circular target in the presence of a narrow obstacle (obstacle) (Fig 2A). Previously these reaching tasks were simulated using a point-mass model of the arm assuming OFC. To capture nonlinear inter-segment dynamics, we modeled the arm as a planar two-segment kinematic chain. To enable muscle activity predictions, the arm model was driven by six Hill-type muscles with rigid tendons consisting of a uni-articular agonist-antagonist pair for the shoulder (posterior and anterior deltoid) and the elbow (brachialis and lateral triceps), and a bi-articular agonist-antagonist pair spanning both the elbow and the shoulder (biceps and long triceps) [20]. Net muscle excitations were a sum of time-varying feedforward excitations and delayed (first order approximation with time constant of 150ms, see Methods) time-varying linear feedback of the end effector (hand) kinematic error [8,10]. Gaussian sensory noise and motor noise were added to the feedback signals and joint torques, respectively. The three different tasks were modeled by constraining the end effector position variability to achieve the task requirements. Optimal feedforward controls, feedback gains and reference end effector trajectories were computed by minimizing expected effort during unperturbed reaching, and the resulting optimal control policies were then used to generate stochastic forward simulations of both unperturbed and perturbed reaching. Perturbations in the perturbed reaching simulations were modelled as external torques inducing shoulder and elbow extension (Fig 2C).

Fig 2.

Fig 2

A: Schematic representation of the three reaching tasks and the musculoskeletal and control model. We simulated three different reaching tasks: (1) reaching towards a small circular target (blue), (2) reaching towards a horizontal bar (red), and (3) reaching towards a circular target in the presence of an obstacle (green). In all simulations, muscle excitations consist of feedforward and feedback contributions. The feedback controller is driven by the error between the end-effector kinematics (EE,EE˙) and the nominal end-effector kinematics (EEref,EE˙ref) and is corrupted by sensory noise. Muscle forces are dependent on muscle activations (amuscle), which are time-delayed muscle excitations, and resulting joint torques are corrupted by motor noise. B: Unperturbed and perturbed reaching trajectories predicted by our model (colored), predicted with a point-mass model (grey) and measured (black). End-point accuracy simulated with our model for the three different reaching tasks. Ellipses denote 95% confidence regions. C: Perturbations applied during the perturbed reaching movements. D: Simulated muscle-level corrective actions that result from optimal feedback in response to unexpected extension perturbations. For elbow and shoulder flexor muscles (anterior deltoid, biceps) we show experimental EMG data from Nashed et al. [9]. E: Average simulated muscle co-contraction level for the shoulder and elbow joints for the three reaching tasks.

Like predictions based on the point-mass model, predictions based on our muscle-driven, multi-segment arm model were in line with the minimum intervention principle [3,58] where kinematic deviations are only corrected when they interfere with the task goal: horizontal deviations are left uncorrected in the bar condition. All our optimal control policies met the required reaching end-point accuracy, for both perturbed and unperturbed reaching, as imposed for the different tasks in the stochastic optimal control problem formulations and in accordance with experimental data [8] (Fig 2B, reaching accuracy).

In contrast to point mass simulations, our non-linear muscle-driven model predicted both unperturbed and perturbed reaching trajectories that were more similar to experimental findings [8]. In unperturbed reaches, an important improvement over predictions from a point mass model are the slightly curved mean trajectories in the circle and bar task, as have been experimentally observed [11] (Fig 2B, compare colored with black and grey trajectories). In the obstacle task the reach trajectories are similar for the point mass and non-linear muscle driven simulations. Similar to the experiments, the non-linear muscle driven simulations show less curvature for the obstacle task compared to the circle task as the second part of the motion the hand is constrained to be on a straight vertical line in the simulation.

Perturbed reach trajectories using our multi-segment muscle-driven arm model were in better agreement with experimental observations than perturbed reach trajectories using a point mass model. In agreement with experiments, and in contrast to point mass models, our model predicted late and steep corrections in the kinematic trajectory for the circle task (Fig 2B colored lines). Our model similarly predicted later corrections occurring over a shorter reaching distance than the point mass model for the obstacle task. Yet, these corrections happened sooner than experimentally observed. For the bar task our model did not predict the overshoot of the hand in the vertical direction that was observed in experiments (Fig 2B perturbed trajectories, red and black lines). The early, symmetric corrections predicted by the point-mass simulations are likely due to the decoupled control of the vertical and horizontal degree of freedom. Given the similarity in the underlying control hypothesis between our simulations and the point-mass simulations of Nashed et al. [8], the improved agreement with experimentally observed reach trajectories indicates the importance of accounting for nonlinear dynamics when predicting reaching movements [58,59].

A novel aspect of our model is that muscle activations in response to perturbations could also be predicted and compared to available EMG recordings. Simulated corrective muscle activations were similar to recordings in the anterior deltoid and brachialis, with larger corrections in the ‘circle’ than in the ‘bar’ condition (Fig 2D, blue vs red). Simulated corrective muscle activations in the ‘obstacle’ (Fig 2D, green) condition were larger and peaked earlier than those in the circle and bar conditions. As EMG data was only available for the anterior deltoid and brachialis in the circle and bar conditions, simulated muscle activity of the antagonistic muscles and in the bar condition remains to be validated experimentally.

We found only limited muscle co-contraction with an average co-contraction index that was always below 0.006 (where 0 indicates no co-contraction and 2 indicates maximal co-contraction). The mean muscle co-contraction index was similar across task conditions for the uni-articular shoulder muscles and bi-articular muscles (Fig 2E). For the uni-articular elbow muscles co-contraction increased in the obstacle task. This was observed experimentally as well [8]. Note that we did not include short-range stiffness in our muscle model, which was necessary in the balance simulations to predict co-contraction, since short-range-stiffness contributions are negligible during movements over large ranges of motion [46]. The increase in co-contraction observed in the ‘obstacle’ condition will therefore have a small effect on joint impedance, suggesting that it follows from the altered feedforward trajectory needed to avoid the obstacle rather than from a shift towards feedback control to increase joint impedance.

Stiffness is modulated during goal-directed reaching in a divergent force field through changes in feedback but not feedforward control

To explore how the dynamics of the environment influences predicted contributions of muscle co-contraction and feedback control, we simulated reaching to a circular target in the presence of a divergent force field (Fig 3). Although co-contraction has been observed during reaching in a force field, the relative contribution of muscle co-contraction and sensorimotor feedback control to increased endpoint stiffness is not known [38,60,61]. We computed end-point stiffness as is done experimentally [62], i.e., by perturbing the hand in a specific direction in simulation and by dividing the resulting change in endpoint force by endpoint displacement at the end of a short-time interval (150ms).

Fig 3.

Fig 3

A: Unperturbed and perturbed reaching trajectories predicted in the presence (‘force field’) and absence (‘circle’) of a 200N/m divergent force field. Simulated end-point accuracy ellipses denote 95% confidence regions. B: Simulated muscle-level corrective actions that result from optimal feedback in response to unexpected extension perturbations. C: Horizontal and vertical end-effector stiffness throughout the reaching movement for the optimal controllers in the absence and presence of the divergent force field, and for the other reaching tasks (in absence of the force field). D: Average muscle co-contraction level for the uni-articular shoulder and elbow muscles and the bi-articular muscles in the presence and absence of the force field while reaching to the circle.

In agreement with experimental data [37,61], simulated reaching accuracies in a force field were similar to reaching accuracies in the absence of the force field (Fig 3A). Our multi-segment, muscle driven model predicted straighter reaching trajectories in a divergent force field than in the absence of a force field, which can be explained by the additional effort required to counteract the force field when not on a straight line (Fig 3A). Next, after a perturbation, corrections were performed more rapidly as the hand was moved sooner towards the nominal trajectory in the presence of a force field than in the absence of a force field. This strategy reduces effort compared to slower corrections in the presence of the force field.

Similar to the experiments performed by Burdet et al. [37] and Franklin et al. [38] our stochastic optimal control model predicted increased stiffness in the horizontal direction but not in the reaching direction in the presence of a horizontal divergent force field of 200 N/m (Fig 3C). The simulated orientation of the stiffness ellipses, with main axes tilted with respect to the horizontal and reaching directions, in both the presence and absence of a force field is in agreement with experimental observations [37,38] but differs from simulations based on a point-mass model, where the main axes of the stiffness ellipse are aligned with respect to the horizontal and reaching directions respectively [30].

In our model, the optimal strategy to increase stiffness in the presence of a divergent force field was to upregulate feedback control to all muscles without increasing co-contraction, while in experiments co-contraction increased significantly [38,58,60,61] (Fig 3B and 3D). The small increase in simulated co-contraction around the elbow in the divergent force field (Fig 3D) likely resulted from small changes in the mean kinematic trajectory and hence feedforward excitations when the force field was applied.

The simulated corrective activity of all muscles in response to perturbations was higher when reaching in a divergent force field than in a stable environment (Fig 3B). These predictions on corrective muscle activity in environments with different stability as well as perturbed reach trajectories in a divergent force field remain to be validated in future experiments.

Discussion

Our major contribution was to apply a recently developed efficient approximate stochastic optimal control framework to simulations of motion enabling us to predict, for the first time, realistic movement trajectories and muscle coordination patterns emerging from nonlinear musculoskeletal dynamics, feedforward, and feedback neural control in the presence of noise. We used a generally applicable method to approximate the true stochastic optimal control problems by deterministic optimal control problems, which could be solved efficiently. The use of direct collocation and gradient-based optimization to solve the approximate deterministic problems facilitated the use of constraints, allowing us to distinguish effort optimization and task-level goals, such as accuracy, rather than to trade them off in the cost function. The framework allowed us to considerably extend the predictions of stochastic optimal control as a theory of motor coordination to relative contributions of feedforward and feedback control of non-linear musculoskeletal systems. The use of nonlinear mechanical models that captured multiple joints and muscles enabled detailed comparison to experimental kinematic and EMG data. In particular, the addition of muscle models in stochastic optimal control simulations, demonstrated that muscle co-contraction can—in the presence of uncertainty in sensory information—minimize muscle effort required for a task. In balance control simulations, we demonstrated that benefits of increased impedance from muscle co-contraction depend on the intrinsic properties of the muscle and the interaction with delayed sensorimotor feedback mechanisms. Similarly, in reaching simulations, we demonstrated that multi-segment musculoskeletal dynamics are key to predicting complex perturbed and unperturbed reach trajectories under different task conditions, resulting from interactions between feedforward and feedback sensorimotor control mechanisms. Taken together, our results showed that complex features of motor control and movement that were not captured by previous optimal control simulations can emerge from nonlinear stochastic optimal control processes. As such, our approximate stochastic optimal control framework provides a valuable tool to providing insight in neuromechanics of normal and possibly impaired movement that incorporates the complexities of the neuromusculoskeletal system and the effects of physiological noise.

Implementing a direct collocation approach to the solution of nonlinear stochastic optimal control problems increased computational efficiency and modelling flexibility, allowing neuro-musculoskeletal model complexity, and thereby fidelity, to be increased. Most prior methods have only allowed efficient solutions of linear, but not nonlinear, stochastic optimal control problems [79,63], and were therefore not capable of predicting many important movement features. Stochastic optimal control simulations based on nonlinear dynamics have previously been performed using iterative Linear-Quadratic-Gaussian (iLQG) methods for reaching tasks [18]. However, such methods require a time-marching integration of the system dynamics under the current guess of the control law at each iteration to evaluate the nominal controls and state trajectories. Such shooting methods might be less suitable than collocation methods for movement tasks with less stable system dynamics, such as standing or walking. Further, the LQG approach assumes a quadratic cost function and a linear and continuously time-varying feedback control law, which might not always represent the biological system. Finally, LQG methods require constraints on the mean state and state covariance (e.g. limiting the standard deviation of the end-point position of the hand in a reaching task) to be imposed through penalty terms in the cost function, whose weights require tedious tuning. In contrast, the approach based on collocation and gradient-based optimization we used here allows imposing such constraints in a direct and intuitive manner. A few previous studies [64,65] have relied on direct collocation methods as well to solve stochastic optimal control problems but they evaluated the stochastic cost function and path constraints based on the simulation of a limited number of noisy episodes rather than describing the state distribution. For nonlinear systems with a limited number of degrees of freedom, such a sampling based approach appears tractable. However, with increasing degrees of freedom, the number of noisy episodes that is required to capture the underlying stochastic dynamics might become computationally intractable. Simulations based on too few episodes might result in problematic unstable and sub-optimal solutions. We overcame the limitations of prior methods by applying a recently proposed method to approximate stochastic optimal control problems by deterministic optimal control problems [26,27]. The proposed approach is based on the assumption that the state distribution can be approximated by a Gaussian distribution and that the propagation of the state covariance can be described by the Lyapunov equation [66]. This framework is applicable to optimal control problem formulations with any cost function design while feedback laws with any temporal and structural design are possible. Here, we used it to simultaneously solve for feedforward and feedback control laws.

Our novel framework allows task-goals to be specified as constraints allowing us to evaluate the minimal effort solution for a given task. It is still debated to what extent optimality assumptions capture human movement and what the optimality criteria underlying human movement are [67]. Previous stochastic optimal control simulations have typically been based on a multi-objective cost function reflecting a trade off in effort and accuracy [3,11,33]. In such simulations, the weights in the cost function must be tuned to produce simulations that match the desired accuracy. By using constraints to impose accuracy requirements and other task goals, task requirements can be separated from optimality principles that govern task execution within these requirements. We reproduced key features of movement kinematics and control by using expected effort as the sole performance criterion. Not only does the approximate stochastic optimal control framework remove the need of building multi-objective cost functions, which might require considerable user intuition, minimizing effort within the solution space allowed by the task requirements might also be more representative of how humans approach a task. Rather than minimizing variability, humans might pick a control strategy that is ‘good enough’ [68]. For example, in standing a ‘good enough’ control strategy prevents a fall and for reaching a ‘good enough’ control strategy brings the hand in the target. The framework, based on direct collocation, can efficiently handle constraints offering flexibility in formulating the optimal control problem by for example imposing control bounds and constraining kinematic variability throughout movement execution in agreement with task requirements. Our approach thus allowed us to further test effort minimization as the optimality criterion underlying human movement.

Our stochastic optimal control framework predicted co-contraction as a minimal effort strategy during perturbed standing balance, suggesting that a combination of co-contraction and feedback corrections is energetically more efficient than feedback corrections only. Our simulations suggest that muscle co-contraction might be a minimal effort strategy to achieve movement goals whereas it has often been seen as an energetically costly strategy to maximize accuracy. Co-contraction reduces effort when task performance benefits from increased joint impedance and activation-dependent muscle properties allow to increase intrinsic mechanical impedance at reasonable costs. This seems to be the case in healthy subjects for translational but not rotational perturbations and in vestibular loss subjects for both translational and rotational perturbations but only when muscle short-range stiffness was taken into account. Prior models also predicted feedback modulation with changing sensory acuity, but did not include muscles that allowed for co-contraction as a complementary strategy [15,45]. Modeling assumptions might have influenced the simulated levels of co-contraction in standing balance control. We used an all or nothing approach to model short-range-stiffness whereas short-range-stiffness disappears when muscle stretch exceeds a certain threshold [44]. Therefore, short-range-stiffness might only contribute during low amplitude sway up to 3° [35,44], and we might have overestimated its contribution when platform perturbations were high enough to induce large sway. Similarly, we might have overestimated contributions from co-contraction by modeling rigid tendon, especially when accounting for short-range stiffness (see also S1 Text).

Muscle co-contraction has been observed experimentally during reaching [69], especially in a divergent force field [58,60,61], but was not predicted by our stochastic optimal control simulations. Berret et al. only considered feedforward control in their simulations of reaching and found higher levels of co-contraction [30] as in the absence of feedback control, co-contraction is the only mechanism to increase joint impedance and hence robustness against perturbations during reaching. Our simulations suggest that muscle co-contraction is an energetically costly strategy when feedback is present resulting in very low simulated levels of co-contraction. Hence, both the simulations of Berret et al. [30] and our simulations are in conflict with the experimental observation of both co-contraction and feedback corrections during reaching. We might have failed to find co-contraction in our simulations because our framework does not allow predictions of the agonist-antagonist dual control strategy depending on muscle co-contraction [10]. By co-contracting antagonistic muscles, it is possible to increase agonist activity and decrease antagonist activity simultaneously yielding a more efficient response to a perturbation. Although our approach to stochastic optimal control accounts for nonlinear state dynamics, it was still based on a linear approximation around the mean state trajectory to propagate the state covariance matrix. An important nonlinearity arises from muscle activity being bound between zero and one and hence, reductions in muscle activity are only possible when muscle activity is larger than zero. Yet this nonlinearity is not reflected in our approximation of the state covariance dynamics, which resulted in predicted reductions in muscle excitations even when this would result in muscle excitations becoming smaller than zero. Hence, there was no need to increase baseline activity in our simulations to exploit the agonist-antagonist dual control strategy, which likely resulted in overly large contributions of this strategy as they came at no cost. To further improve the realism of our simulations, we thus need to more accurately describe the nonlinearities in the dynamics of the state covariance.

Several other modeling assumptions might have influenced the simulated levels of co-contraction. First, we did not model that co-contraction increases short-latency reflexes through gain scaling [70,71], thereby underestimating the contribution of co-contraction to reflexes. Second, due to the first order approximation to model feedback delays, sensory information–although attenuated—is available immediately. This might have increased the efficiency of feedback corrections, which in turn might have reduced the need for muscle co-contraction to increase joint impedance and provide instantaneous resistance against perturbations. Note, however, that the presence of sensorimotor noise prevented simulated feedback gains and therefore ‘early’ feedback contributions from becoming too large. Including an exact delay within the framework is important future work, and approximating delays by a low-pass filter must be done with caution when drawing conclusions. Third, feedback gains were unconstrained in how they could change as a function of time. Such infinite bandwidth is probably unrealistic and more realistic models of control dynamics might reduce the efficiency of feedback control and thereby increase simulated levels of co-contraction. Although the level of co-contraction might be sensitive to each of the above modeling assumptions, our simulations that simultaneously solved for optimal feedforward and feedback control contributions demonstrated that both muscle co-contraction and feedback gains varied as a function of the task, mechanical properties of the muscles, and sensory acuity.

A major contribution of our work was to create a framework for predicting how stochastic optimal control principles apply at the level of individual joints and muscles, which is difficult to impossible with currently available tools. Generating simulations that predict execution-level physiological variables that can be compared to experimental measures requires the subsystems, i.e. joints and muscles, from which those measures are obtained to be modeled. As such the ability of stochastic optimal control to predict movement cannot be accurately validated using simple models. While it is known that using different levels of detail to represent the musculoskeletal dynamics leads to different simulated responses to perturbations if motor commands remained unchanged [72], optimal control predictions can inform us on how the nervous system can exploit the nonlinear dynamics of the musculoskeletal system. For example, our two-segment, muscle-driven arm model yielded more realistic reach trajectories when reaching to different targets, when compared to prior predictions based on a point mass [8]. We further explored the sensitivity of predicted reach trajectories to the presence of bi-articular muscles and muscle properties (see S1 Text for details). Interestingly, simulated reach trajectories were robust against the removal of bi-articular muscles but sensitive to how mono-articular muscles were modeled. Using an alternative, physiologically plausible set of muscle properties yielded perturbed reach trajectories in the ‘bar’ condition that captured the experimentally observed overshoot. However, the overshoot was also predicted for perturbed reach trajectories in the ‘circle’ condition, where it was not observed experimentally. These exploratory results demonstrate both the potential of our framework to explore the effect of musculoskeletal properties on (perturbed) movement and the importance of accurate musculoskeletal models.

Simulations of endpoint stiffness in an unstable force field show, similar to experimental observations [58,60], that stiffness is modulated depending on the environment and that the orientation of the main axes of the stiffness ellipses are tilted with respect to the reaching direction as a result of musculoskeletal dynamics. Berret et al. [31] combined a similar musculoskeletal model with a different control strategy, i.e. feedforward control in the absence of feedback control, and found a similar tilt of the stiffness ellipses, suggesting that musculoskeletal dynamics is a determining factor. These examples demonstrate how stochastic optimal control applied to muscle-driven models can be used to interpret complex motor coordination at the muscle level and to study control of redundant sets of muscles in the presence of noise.

Leveraging computational advances from deterministic movement simulations to enable predictions for stochastic movement simulations may yield much insight into both normal and pathological movement control, as well as human-robot interactions. Under the assumption of stochastic optimal control, our movement predictions became more realistic when modelling neuro-musculoskeletal dynamics in more detail. Moving beyond deterministic simulations may enable coupled changes in complex, whole body movement and its neural control to be studied in both healthy and impaired movement. It is critical to consider interactions of feedforward and feedback neural control mechanisms with musculoskeletal mechanics in a highly redundant space of feasible neuromusculoskeletal solutions for movement, particularly in the presence of noise. Optimal control processes may also explain compensatory changes in neural control of movement in impaired motor control [73]. Moreover, the role of nonlinear musculoskeletal dynamics has been recently demonstrated to play a significant role in impaired motor control such as in spasticity [6]. Further, movement variability might complicate the control and design of exoskeletons and other assistive devices. Incorporating uncertainty and movement variability in simulations that are often used to generate and test ideas before implementing these in reality, could facilitate the design process of assistive devices.

Methods

Approximating the stochastic optimal control problem by a deterministic optimal control problem

We formulate simulations of movement in the presence of noise as stochastic optimal control problems. The system dynamics are stochastic: x˙(t)=f(x(t),e(t),wsys); with x the state trajectories, e the control trajectories and wsys a set of stochastic disturbances with zero-mean Gaussian distribution (noise) acting on the system. The control trajectories e(t) are determined by the control policy:

e(t)=eff(t)+K(t)*yfb(x(t),wpolicy),

which consists of a deterministic feedforward term eff(t) and a linear feedback term with the deterministic feedback matrix K(t) that is multiplied with the feedback signal yfb(x(t), wpolicy). wpolicy is a set of stochastic disturbances, with a zero-mean Gaussian distribution, acting within the control policy. Note that when K(t) is non-zero, the control trajectories e(t) are stochastic, and depend on the state.

To solve the stochastic optimal control problems, we approximate the stochastic state trajectories, which are in general non-normally distributed, by normally distributed trajectories. As a result, we can describe the stochastic state trajectory by the mean state trajectory xmean(t) and state covariance trajectory P(t). The dynamics of the mean state can be described by a deterministic approximation of the stochastic dynamics obtained by setting the disturbances to their mean value, which is zero (w = [wsys, wpolicy] = 0):

x˙mean(t)=f(xmean(t),e(t),w=0). (1)

The dynamics of the state covariance can be described by the continuous Lyapunov differential equations based on a local first-order approximation of the nonlinear system dynamics around the mean state, corresponding to the propagation rules used in the extended Kalman Filter ([29,74]):

P˙(t)=A(t)P(t)+P(t)A(t)T+C(t)ΣwC(t)T (2)
A(t)=(fx(t,x(t),e(t),w))x(t)=xmean(t) (3)
C(t)=(fw(t,x(t),e(t),w))x(t)=xmean(t) (4)

with Σw the continuous time covariance matrix describing the noise sources. Eqs (14) form a deterministic approximation of the stochastic dynamics. Similarly, we can approximate the stochastic constraint functions g(x(t)) by a normal distribution with mean μg(x) = g(xmean), and standard deviation σg(x)=gxP(t)gxT. Using this approach, we can transform the stochastic optimal control problem into an approximate deterministic optimal control problem:

mineff(t),K(t)tstarttfinalJcost(xmean(t),eff(t),K(t),P(t))dtcost (5)
subjecttox˙mean(t)=f(xmean(t),e(t),w=0)meanstatedynamics (6)
P˙(t)=A(t)P(t)+P(t)A(t)T+C(t)ΣwC(t)Tcovariancedynamics (7)
gi(xmean(t),u(t))+γigixP(t)giTx0i=1,ngpathconstraints (8)

with γi a parameter determining the probability that the state trajectory fulfills the constraint and ng the number of path constraints. Given the Gaussian approximation of the state distribution, γi would need to be infinitely large to impose the constraint over the whole distribution. In practice, we choose finite values for γi, where the chance of fulfilling the constraints is 95% when γi = 2, and 99.7% when γi = 3.

We solve the approximate deterministic optimal control problems using direct collocation with a trapezoidal integration scheme and mesh intervals of 10ms and solve the resulting large, but sparse nonlinear programming problems (NLP) with IPOPT [75]. We formulate all dynamics implicitly (details in S1 Text) to improve the numerical condition of the NLP [6,23,76]. We use CasADi [77] to perform automatic differentiation, which improves the accuracy of the derivative computations and might reduce the number of operations to compute gradients through use of its reverse mode [24]. For details on the numerical implementation of the optimal control problems, we refer to S1 Text. The code for simulating the presented models is publicly available at: https://github.com/tomvanwouwe1992/SOC_Paper.

Stochastic optimal control simulations of movement

We applied our stochastic optimal control framework to two fundamental movements that have been studied extensively: standing balance and goal-directed reaching. An overview of the musculoskeletal and motor control models is provided in Figs 1 and 2. In general, we solved for both feedforward (i.e., open-loop) and feedback components (i.e. feedback gains) of the control law to perform the prescribed movement task robustly with minimal expected effort. For standing balance, we imposed task robustness by requiring the solution to be marginally stable, i.e. the state covariance was constant. For reaching, we imposed the required accuracy of the reaching movement depending on the target shape.

Standing balance

We first give a general description of the optimal control problem, simulations and outcome measures. Dynamic equations and model parameters are described in more detail below.

General description of standing balance simulations and outcome measures

We modeled standing in the presence of platform perturbations based on an inverted pendulum (IP) model (mass: 70kg, length: 1m) linked to a rotatable and translational platform (Fig 1). Two antagonistic Hill-type muscles with rigid tendons actuated the ankle joint, i.e., the joint connecting the pendulum to the platform. The muscle properties (maximal isometric force FISO, tendon slack length lTs, optimal fiber length lMo, optimal pennation angle α, damping coefficient β; described in Table 1) were taken from the soleus and tibialis anterior muscle of the OpenSim3.3 gait10dof18musc model [78]. Input to the Hill-type muscles were muscle activations (aSOL, aTA). Activations resulted from excitations through first-order dynamics with a time constant (τ) of 150ms lumping together sensory and motor delays [79]. These muscle excitations resulted from excitations that were composed of feedforward (eSOL,ff, eTA,ff) and feedback excitations (eSOL,fb, eTA,fb = K*yfb). The feedback excitations were a linear combination of the angle and angular velocity of the inverted pendulum with respect to the gravitational field, representing vestibular information, and of the angle and angular velocity of the inverted pendulum with respect to the platform, representing proprioceptive information. All feedback gains were constant in time. The policy noise (wpolicy) consisted of Gaussian vestibular (wv,q,wv,q˙) and proprioceptive noise (wp,q,wp,q˙) with respective variance σv,q2,σv,q˙2 and σp,q2,σp,q˙2 that was added to both the vestibular and proprioceptive cues. Motor noise, which was part of the system noise (wsys), consisted of additive motor noise (wSOL, wTA) with variance (σSOL2,σTA2) that corrupted the muscle activations. All sensory and motor noise sources were independent and values for the noise source variances are provided in Table 2.

Table 1. Muscle properties of model for perturbed standing simulations.

Muscle properties
soleus tibialis anterior
FISO[N] 5137 3000
lTs[m] 0.2514 0.2228
lMo[m] 0.0528 0.1028
α [rad] 0.4364 0.0873
β 0.01 0.01

Table 2. Noise characteristics for perturbed standing balance simulations.

Sensory noise Motor noise σSS,q˙2[(°s)2/Hz] 0.0012; 0.32; 0.62; 1.22; 2.42; 4.82; 9.62
σp,q2[(°)2/Hz] 0.12 σSOL2[()2./Hz] 0.012
σp,q˙2[(°s)2/Hz] 0.22 σTA2[()2/Hz] 0.012 Platform translations
σv,q2[(°)2/Hz] 0.32 Platform rotations σSS,trans2[(ms2)2/Hz] 0.0012; 0.01752; 0.0352; 0.072; 0.142; 0.282; 0.562
σv,q˙2[(°s)2/Hz] 0.62 σSS,q2[(°)2/Hz] 0.0012; 0.1252; 0.252; 0.52; 12; 22; 42

We modeled random rotation and translation perturbations through adding additional sources of system noise (wsys). The random platform angular position and velocity were modeled as zero mean Gaussian noise (wSS,q,wSS,q˙) with constant variance (σSS,q2,σSS,q˙2). We introduce the subscript ‘ss’ to indicate the variables related to the platform or support-surface and to avoid confusion with subscript ‘p’ for proprioception. To simulate random translation perturbations, we only needed to describe the platform translational acceleration as zero mean Gaussian noise (wSS,trans) with variance (σSS,trans2). We could ignore the random translational position and velocity of the platform as these affect neither the dynamics of the pendulum nor the proprioceptive and vestibular information (relative and absolute pendulum angles). Values for the noise source variances are provided in Table 2.

The task-goal during standing was to maintain a stable upright posture (marginal stability). Postural marginal stability, in the presence of noise, was modeled by constraining the mean angle of the pendulum to be upright with zero angular velocity, the mean state derivatives to be zero (mean pendulum position and velocity and muscle activations are constant in time), and the state covariance derivatives to be zero: P˙=0 (state covariance matrix was constant in time). We thus solved for a single state, rather than a state trajectory, and the corresponding control policy to maintain a stable upright posture in the presence of noise. We solved for the constant feedforward muscle excitations and feedback gains given the described constraints while minimizing expected effort. Expected effort was modeled as the expected value of the sum of muscle excitations squared:

E[(eSOL,ff+eSOL,fb)2+(eTA,ff+eTA,fb)2] (9)

which is equivalent to

eSOL,ff2+eTA,ff2+Var[eSOL,fb]+Var[eTA,fb] (10)

since the deterministic feedforward terms are uncorrelated to the stochastic feedback terms and because the stochastic feedback terms (eSOL,fb, eTA,fb) have an expected value of zero (a derivation can be found in the S1 Text).

We performed simulations based on four different models. We performed simulations with the full feedback model, representing healthy subjects, and simulations with only proprioceptive feedback, representing vestibular loss subjects. We performed these simulations based on two muscle models, a Hill-type muscle model and a Hill-type model that was extended with short-range-stiffness (SRS). Short-range stiffness was modeled by adding a spring with activation dependent stiffness in parallel to the contractile element producing a force

FSRS=kSRSFiSo,maxa(l˜ml˜m,meanl˜m,opt), (11)

with a the muscle baseline activation, l˜m the muscle fiber length, l˜m,mean the muscle fiber length in mean (upright) position, l˜m,opt the optimal muscle fiber length and kSRS the short-range-stiffness scaling factor, which was set to 1 [34]. We thus have four models: healthy (‘healthy’), healthy with muscles including short-range-stiffness (‘healthy—SRS’), vestibular loss (‘VL’), vestibular loss with muscles including short-range-stiffness (‘VL—SRS’).

Our outcome measures were (1) body sway–described by the standard deviation of the normally distributed pendulum angle, (2) the relative contribution of proprioceptive feedback, KproprioKproprio+Kvest, (3) the co-contraction index (CCI), computed as in [47]: min(aSOL,aTA)max(aSOL,aTA)*(aSOL+aTA), and (4) the contribution of expected effort from feedback to the total expected effort: effortFBeffortFB+effortFF.

Stochastic dynamics and model parameters

We indicated variables that are modeled as Gaussian noise in red. The state consisted of the segment angle and angular velocity, q,q˙, the activation of soleus and tibialis anterior, aSOL and aTA and the platform angle and angular velocity, qSS and q˙SS:

x=[qq˙aSOLaTAqSSq˙SS] (12)

The control law was parametrized by the baseline muscle excitations, eff, and the constant feedback gains, K:

eff=[eSOL,ffeTA,ff];K=[Kq,propSOLKq˙,propSOLKq,vestSOLKq˙,vestSOLKq,propTAKq˙,propTAKq,vestTAKq˙,vestTA], (13)

where SOL refers to the soleus and TA refers to the tibialis anterior. The dynamics were described by the equations of motion of the pendulum and the first order delay between excitation and activation. The equations of motion were expressed in a non-inertial reference frame by introducing fictitious forces due to the translational platform acceleration:

[dq/dtdq˙/dtdaSOL,fb/dtdaTA,fb/dtdqSS/dtdq˙SS/dt]=[q˙mgl;sin(q)ml2+I+TSOL+TAml2+I+ml;sin(q)ml2+IwSS,trans(eSOL,fb+eSOL,ffaSOL)/τ(eTA,fb+eTA,ffaTA)/τ00] (14)

with m the pendulum mass, l the pendulum length, g the gravity constant, I the pendulum inertia, TSOL+TA the torque generated by the soleus and tibialis anterior. Note that the platform has a constant velocity dqSSdt=0 and acceleration dq˙SSdt=0, meaning that the mean platform position and velocity and the platform position and velocity covariance matrix are per definition constant in time and equal to their chosen initial value. The ankle angle and angular velocity result from the difference between the pose of the segment and the platform:

qA=qqSS,q;q˙A=q˙q˙SS,q˙

The feedback muscle excitations (eSOL,fb, eTA,fb) resulted from linear feedback of the feedback signal yfb that consists of a proprioceptive, and vestibular (‘v’) signal:

yfb=[qA+wp,qq˙A+wp,q˙q+wv,qq˙+wv,q˙];[eSOL,fbeTA,fb]=K;yfb (15)

Note that we assumed here that the platform’s mean linear and angular position, velocity and acceleration were constant in time with constant variance. This differs from driving the platform acceleration with a zero-mean Gaussian input signal, which would lead to a monotonic increase of the velocity and position variance in time. Instead, we assumed that a more clever platform controller was used.

The torque generated by the soleus and tibialis anterior muscles is a function of their forces and moment arms:

TSOL+TA=FSOLdSOL(qA)+FTAdTA(qA) (16)

with dSOL(qA), dTA(qA) the soleus and tibialis anterior moment arms, which depend on the ankle angle qA. Muscle forces depend on muscle activation and through the force-length-velocity properties of the muscle also on the muscle length (lSOL, lTA) and velocity (l˙SOL,l˙TA), which in turn are a function of the ankle angular position and velocity qA,q˙A:

[FSOLFTA]=[FISO,SOL[(aSOL,fb+aSOL,base+wSOL)fl(lSOL)fv(lSOL,l˙SOL)+fp,SOL(lSOL)]FISO,TA[(aTA,fb+aTA,base+wTA)fl(lTA)fv(lTA,l˙TA)+fp,TA(lTA)]] (17)

with fl the active muscle force-length relationship, fv the muscle force-velocity relationship, and fp the passive muscle force-length relationship. The active force-length, force-velocity, and passive force-length relationships are described in [22].

Muscle-tendon lengths were approximated by the sum of a linear function, a sine, and a constant offset (lMT = aq+b∙sin(cq)+d) with a, b, c and d estimated by minimizing the least square error between this approximation and the muscle lengths obtained from the OpenSim gait10dof18musc model [75]. The moment-arms are computed as the derivatives of the muscle-tendon lengths with respect to the angle: dmuscle = a+bc∙cos(cq) [78].

Noise characteristics (summarized in Table 2) were based on preliminary simulations and experimental data [36]. Motor noise, added to muscle activations, had a standard deviation of 1% of the maximal signal based on force fluctuation measurements in different isometric tasks [80,81], where a coefficient of variation between 1–5% was found. These measurements quantify motor noise indirectly as force-tracking errors might have other origins as well and we therefore selected the lower end of the measured variability to model motor noise. The relative values of proprioceptive and vestibular noise were selected such that the relative contribution of proprioceptive feedback, KpropKprop+Kvest, was between 0.7–0.8 during optimal unperturbed standing in agreement with values identified from experiments by Peterka [32]. The absolute values for sensory noise were selected such that during optimal unperturbed standing body sway, defined as the standard deviation of the pendulum angle, was ~0.3°, a typical value found in experiments of quiet standing in healthy subjects [78]. The values that determine the platform rotations, σSS,q2,σSS,q˙2, were selected to mimic the rotational perturbations applied in [36]. In these experiments, the platform angular position, velocity and acceleration were not Gaussian. Here, we approximated the non-Gaussian experimental platform movements by zero-mean Gaussian platform movements with a standard deviation of half the amplitude of the experimental perturbations. The variance of the translational accelerations σSS,trans2 were determined such that the healthy model for the maximal accelerations, under optimal control, reached a standard deviation of the ankle angle of 4°, a value that is typically not exceeded in continuous translation perturbation experiments [82,83].

The time unit appears in the noise variance ([/Hz]) or ([.s]) to describe the power spectral density of a continuous-time Gaussian noise. If we perform a numerical integration and thus move to a discrete-time description of continuous noise the unit of time disappears by dividing the variance by the integration interval length (expressed in seconds). This makes sense if we reflect about a forward integration of a 1D point mass where the velocity has a continuous variance of e.g. 2 (m/s)2/Hz. If we perform the numerical integration over an interval of 1s the discrete variance (Σ) is 2 (m/s)2, we find that the variance of the position after 1s is equal to 2m2: Pk+1=Pk+dt*Σ*dtPk+1=0+1[s]*2[(m/s)2]*1[s]=2m2. If we perform the numerical integration over 1s using a time step of 0.1s the discrete variance is 20 (m/s)2. We obtain P0.1s=0+0.1[s]*20[(m/s)2]*0.1[s]=0.2m2. We obtain for P0.2s=0.2[m2]+0.1[s]*20[(m/s)2]*0.1[s]=0.4m2. Finally, P1s = 2m2.

Goal-directed reaching

General description of reach simulations and outcome measures

We modeled four reaching tasks based on a two-segment model, where the segments represent the upper and lower arm and are connected by hinge joints (Fig 2). The arm model was driven by six Hill-type muscles with rigid tendons consisting of a uni-articular agonist-antagonist pair for the shoulder (anterior and posterior deltoid) and the elbow (brachialis and lateral triceps) and a bi-articular agonist-antagonist pair spanning both the elbow and the shoulder (biceps and long triceps).The muscle properties (maximal isometric force FISO, normalized maximal contraction velocity vmax, and damping coefficient β; described in Table 3) were identical to the model used and reported in [20] and are reported in Table 3. The muscles were stimulated by muscle excitations (eBRACH, eLATTRI, eANTDEL, ePOSTDEL, eBIC, eLONGTRI) that were a sum of feedforward (eff) and feedback excitations (efb = K*yfb). Feedback excitations consisted of linear time-varying feedback of the end effector (hand) position and velocity with respect to the nominal end effector kinematics. The nominal end effector kinematics was the end effector kinematics due to the feedforward excitations in the absence of sensory and motor noise. Activations were related to excitations through first-order dynamics with a time constant (τ) of 150ms lumping together sensorimotor delays. Policy or sensory noise was modeled by additive Gaussian noise on the end effector position (wEEx,wEEy) and velocity (wEE˙x,wEE˙y) with respective variance (σEE,x2,σEE,y2,σEE,x˙2,σEE,y˙2). The noisy end effector positions and velocities were input to the feedback law. System noise consisted of motor noise that was modeled by Gaussian additive noise added to each joint torque (ws, we) with variance (σs2,σe2).

Table 3. Musculoskeletal properties of model for goal-directed reaching.
Skeletal properties Muscle properties
Upper arm Forearm BRACH LATTRI ANTDEL POSTDEL BIC LONGTRI
m [kg] 1.4 1.0 FISO[N] 572 445 700 382 159 318
l [m] 0.3 0.33 v˜M,max 10 10 10 10 10 10
I[kg.m2] 0.025 0.045 β 0.01 0.01 0.01 0.01 0.01 0.01

The task-goal was to perform a point-to-point reaching movement over a distance of 25cm in 0.8s with a pre-defined chance of the end effector ending up within the target. Reaching accuracy was imposed by limiting the variance of the horizontal and vertical end effector position depending on the task requirements.

  1. Reaching towards a small circular target (circle) was modeled by constraining the standard deviation of the end effector end-point horizontal and vertical positions to be smaller than 0.4cm.

  2. Reaching towards a horizontal bar (bar) was modeled by constraining the standard deviation of the end effector end-point vertical position to be smaller than 0.4cm

  3. Reaching towards a circular target in the presence of an obstacle was modeled by imposing the standard deviation of the end effector end-point horizontal and vertical positions to be smaller than 0.4cm and the standard deviation of end effector horizontal position during the second part (>0.25s) of the reaching trajectory to be smaller than 0.4cm

  4. Reaching towards a circular target in the presence of a divergent force field of 200N/m was modeled by constraining the standard deviation of the end effector end-point horizontal and vertical positions to be smaller than 0.4cm.

We solved for optimal control policies that minimized expected effort,

E[[eBRACH(t)2+eLATTRI(t)2+eANTDEL(t)2+ePOSTDEL(t)2+eBIC(t)2+eLONGTRI(t)2]dt], (18)

while fulfilling the task requirements for each of the specific tasks. We then used these optimal control policies to perform 100 forward simulations of unperturbed and perturbed reaching. The simulated extension perturbations matched the perturbations in the experiments described by Nashed et al. [8] and are shown in Fig 2C.

We computed the accuracy for the different optimal control policies in perturbed and unperturbed reaching by computing the 95% confidence ellipses of the end-point positions of the end effector for the 100 simulations. We computed the corrective muscle activations for perturbed reaching by subtracting the mean muscle activations during unperturbed reaching from the muscle activations for each of the 100 perturbed reaching simulations. We computed the mean and standard deviation of these corrective muscle activations to analyze corrective behavior at the muscle level. We computed the co-contraction index throughout the reaching movement for each of the joints by averaging the joint-specific CCI over time. We computed the CCI as in [47]: min(aflex(t),aext(t))max(aflex(t),aext(t))*(aflex(t)+aext(t)) where aflex and aext are the activations of the flexor and extensor muscles of the antagonistic pairs.

Stochastic dynamics and model parameters

We indicated variables that are described as constant zero-mean Gaussian noise in red. The state consisted of the joint angular positions (qs, qe) and velocities (q˙s,q˙e), and the activations of the anterior deltoid, the posterior deltoid, the biceps and the triceps muscles (aBRACH, aLATTRI, aANTDEL, aPOSTDEL, aBIC, aLONGTRI).

x=[qsqeq˙sq˙eaBRACHaLATTRIaANTDELaPOSTDELaBICaLATTRI] (19)

The control law was parametrized by the feedforward muscle excitation trajectories eff(t) and the time-varying feedback gains, K(t):

eff(t)=[eff,BRACH(t)eff,LATTRI(t)eff,ANTDEL(t)eff,POSTDEL(t)eff,BIC(t)eff,LONGTRI(t)];K(t)=[KqsBRACH(t)KqeBRACH(t)Kq˙sBRACH(t)Kq˙eBRACH(t)KqsLATTRI(t)KqeLATTRI(t)Kq˙sLATTRI(t)Kq˙eLATTRI(t)KqsANTDEL(t)KqeANTDEL(t)Kq˙sANTDEL(t)Kq˙eANTDEL(t)KqsPOSTDEL(t)KqePOSTDEL(t)Kq˙sPOSTDEL(t)Kq˙ePOSTDEL(t)KqsBIC(t)KqeBIC(t)Kq˙sBIC(t)Kq˙eBIC(t)KqsLONGTRI(t)KqeLONGTRI(t)Kq˙sLONGTRI(t)Kq˙eLONGTRI(t)] (20)

The dynamics were described by the equations of motion and the first order delay between excitations and activations:

[dqdtdq˙dtdadt]=[q˙M(q)1(C(q,q˙)+TM+[WSwe])(ea)/τ] (21)

with M(q) the mass-matrix of the arm model, C(q,q˙) the term describing the Coriolis forces, TM the shoulder and elbow joint torques generated by the muscles and ws, we the stochastic torque acting at shoulder and elbow. The total muscle excitations were the result of feedforward and feedback control, where the feedback signal is the noisy (wEEx,wEEy,wEE˙x,wEE˙y) deviation from the end effector position and velocity with respect to the expected or mean end effector positions and velocity:

yfb=[EEx+wEExEEy+wEEyEE˙x+wEE˙xEE˙y+wEE˙y]EEref;e=eff+Kyfb. (22)

The end effector positions (EEx, EEy) and velocities (EE˙x,EE˙y) can be computed from the joint positions and velocities:

[EExEEyEE˙xEE˙y]=fkin(q,q˙) (23)

The reference end effector trajectory (EEref) is the end effector trajectory in the absence of noise, which can thus be computed from the mean joint trajectories.

EEref=fkin(qmean,q˙mean)

The shoulder and elbow torques generated by the different muscles depends on muscle forces and moment arms, which in turn depend on the shoulder and elbow joint angles:

TM=[FANTDEL;dANTDEL(qs)+FPOSTDELdPOSTDEL(qs)+FBICdBIC,s(qs)+FLONGTRIdLONGTRI,s(qs)FBIRACH;dBRACH(qe)+FLATTRIdLATTRI(qe)+FBICdBIC,e(qe)+FLONGTRIdLONGTRI,e(qe)], (24)

with dANTDEL(qs),dPOSTDEL(qs),dBRACH(qe),dLATTRI(qe),dLONGTRI,s(qs),dLONGTRI,e(qe),dBIC,e(qe),dBIC,s(qs) the muscle moment arms depending on the articulated joint angles. Muscle forces depend on muscle activation and through the force-length-velocity properties of the muscle also on the normalized muscle fiber length (lBRACH, lLATTRI, lANTDEL, lPOSTDEL, lBIC, lLONGTRI) and velocity (l˙BRACH,l˙LATTRI,l˙ANTDEL,l˙POSTDEL,l˙BIC,l˙LONGTRI), which in turn were a function of skeleton positions and velocities (qs,qe,q˙s,q˙e):

[FBRACHFLATTRIFANTDELFPOSTDELFBICFLONGTRI]=[FISO,BRACH[aBRACHfl(lBRACH)fv(lBRACH,l˙BRACH)+fp(lBRACH)]FISO,LATTRI[aLATTRIfl(lLATTRI)fv(lLATTRI,l˙LATTRI)+fp(lLATTRI)]FISO,ANTDEL[aANTDELfl(lANTDEL)fv(lANTDEL,l˙ANTDEL)+fp(lANTDEL)]FISO,POSTDEL[aPOSTDELfl(lPOSTDEL)fv(lPOSTDEL,l˙POSTDEL)+fp(lPOSTDEL)]FISO,BIC[aBICfl(lBIC)fv(lBIC,l˙BIC)+fp(lBIC)]FISO,LONGTRI[aLONGTRIfl(lLONGTRI)fv(lLONGTRI,l˙LONGTRI)+fp(lLONGTRI)]] (25)

with fl the active muscle force-length relationship, fv the muscle force-velocity relationship, and fp the passive muscle force-length relationship. The active force-length, force-velocity, and passive force-length relationships are described in [22].

Normalized muscle fiber lengths were approximated by the sum of linear functions and sines of the joint angles added to a constant offset: (lM=asqs+bscssin(csqs)+aeqe+becesin(ceqe)+d). The muscle-moment arms were computed as scaled versions of the derivatives of the normalized fiber lengths with respect to the joint angle: ds=ss(as+bssin(csqs)); de=se(ae+besin(ceqe)), where the scaling factors serve to denormalize the muscle-moment arms. The coefficients parametrizing the muscle-tendon lengths and moment arms (as, bs, cs, ss, ae, be, ce, se, and d) were estimated to match the data provided in [20].

We computed the variance of the end effector position and velocity in the horizontal and vertical directions, used to define accuracy constraints, based on the following equations:

Var(EEx(t),EEy(t),EE˙x(t),EE˙y(t))=trace
((fkin(q(t),q(t)˙)x)qmean(t),q˙mean(t)P(t)(fkin(q(t),q˙(t))x)qmean(t),q˙mean(t)) (26)

with P(t) the covariance matrix used to approximate the stochastic state.

Noise characteristics summarized in Table 4 are based on preliminary simulations and experimental data. Motor noise was defined as a noisy torque actuator with zero mean and a variance power spectral density of 0.05(Nm)2/Hz. The absolute values for sensory noise were selected such that during optimal unperturbed reaching to the circle target an end-point accuracy of 0.4cm was achievable but an accuracy of 0.2cm was not [8].

Table 4. Noise characteristics for goal-directed reaching simulations.

Sensory noise Motor noise
σEE,x2[(mm)2/Hz] 0.62 σSH2[(Nm)2/Hz] 0.052
σEE,y2[(mm)2/Hz] 0.62 σEL2[(Nm)2/Hz] 0.052
σEE,x˙2[(mms)2./Hz] 4.82
σEE,y˙2[(mms)2./Hz] 4.82

Supporting information

S1 Text. Describes additional results, motivation and consequences on assuming rigid tendons and more details on the implemented method.

(PDF)

Data Availability

No original experimental data was produced. Code is available on GitHub at https://github.com/tomvanwouwe1992/SOC_Paper Code runs in MATLAB R2020a.

Funding Statement

We gratefully acknowledge FWO (Research Foundation, Flanders: https://www.fwo.be/en/) for the fellowships FWO 1S82320N funding TVW and G079216N funding FDG that allowed to perform this research. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Umberger BR, Miller RH. Optimal control modeling of human movement. Handb Hum Motion. 2018;1–3: 327–348. doi: 10.1007/978-3-319-14418-4_177 [DOI] [Google Scholar]
  • 2.Dembia CL, Bianco NA, Falisse A, Hicks JL, Delp SL. OpenSim Moco: Musculoskeletal optimal control. doi: 10.1101/839381 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Todorov E, Jordan MI. Optimal feedback control as a theory of motor coordination. 2002. doi: 10.1038/nn963 [DOI] [PubMed] [Google Scholar]
  • 4.Ackermann M, van den Bogert AJ. Optimality principles for model-based prediction of human gait. J Biomech. 2010;43: 1055–1060. doi: 10.1016/j.jbiomech.2009.12.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Groote F De, Falisse A. Perspective on musculoskeletal modelling and predictive simulations of human movement to assess the neuromechanics of gait. Proceedings Biol Sci. 2021;288: 20202432. doi: 10.1098/rspb.2020.2432 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Falisse A, Serrancolí G, Dembia CL, Gillis J, Jonkers I, De Groote F. Rapid predictive simulations with complex musculoskeletal models suggest that diverse healthy and pathological human gaits can emerge from similar control strategies. J R Soc Interface. 2019;16. doi: 10.1098/rsif.2019.0402 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Crevecoeur F, Scott SH. Beyond Muscles Stiffness: Importance of State-Estimation to Account for Very Fast Motor Corrections. Valero-Cuevas FJ, editor. PLoS Comput Biol. 2014;10: e1003869. doi: 10.1371/journal.pcbi.1003869 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Nashed JY, Crevecoeur F, Scott SH. Influence of the behavioral goal and environmental obstacles on rapid feedback responses. J Neurophysiol. 2012;108: 999–1009. doi: 10.1152/jn.01089.2011 [DOI] [PubMed] [Google Scholar]
  • 9.Harris CM. On the optimal control of behaviour: a stochastic perspective. J Neurosci Methods. 1998;83: 73–88. doi: 10.1016/s0165-0270(98)00063-6 [DOI] [PubMed] [Google Scholar]
  • 10.Crevecoeur F, Scott SH, Cluff T. Robust Control in Human Reaching Movements: A Model-Free Strategy to Compensate for Unpredictable Disturbances. J Neurosci. 2019;39: 8135–8148. doi: 10.1523/JNEUROSCI.0770-19.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Harris C, Wolpert D. Signal-dependent Noise determines Motor Planning. Nature. 1998;394: 780–784. Available: http://wexler.free.fr/library/files/harris. doi: 10.1038/29528 [DOI] [PubMed] [Google Scholar]
  • 12.Scott SH. The computational and neural basis of voluntary motor control and planning. Trends in Cognitive Sciences. Elsevier; 2012. pp. 541–549. doi: 10.1016/j.tics.2012.09.008 [DOI] [PubMed] [Google Scholar]
  • 13.Todorov E. Optimality principles in sensorimotor control. Nat Neurosci. 2004;7: 907–915. doi: 10.1038/nn1309 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kuo AD. The six determinants of gait and the inverted pendulum analogy: A dynamic walking perspective. Hum Mov Sci. 2007;26: 617–656. doi: 10.1016/j.humov.2007.04.003 [DOI] [PubMed] [Google Scholar]
  • 15.Kuo AD. An optimal state estimation model of sensory integration in human postural balance. J Neural Eng. 2005;2: 235–249. doi: 10.1088/1741-2560/2/3/S07 [DOI] [PubMed] [Google Scholar]
  • 16.Diedrichsen J. Optimal Task-Dependent Changes of Bimanual Feedback Control and Adaptation. Curr Biol. 2007;17: 1675–1679. doi: 10.1016/j.cub.2007.08.051 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Diedrichsen J, Shadmehr R, Ivry RB. The coordination of movement: optimal feedback control and beyond. Trends in Cognitive Sciences. Elsevier Current Trends; 2010. pp. 31–39. doi: 10.1016/j.tics.2009.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Todorov E, Li W. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. Available: www.cogsci.ucsd.edu/~todorov. [Google Scholar]
  • 19.Li W, Todorovz E. Iterative linearization methods for approximately optimal control and estimation of non-linear stochastic system. Int J Control. 2007;80: 1439–1453. doi: 10.1080/00207170701364913 [DOI] [Google Scholar]
  • 20.Li W, Todorov E. Iterative Linear Quadratic Regulator Design for Nonlinear Biological Movement Systems. Iterative Linear Quadratic Regulator Design for Nonlinear Biological Movement Systems. 2004. Available: https://www.researchgate.net/publication/220271553. [Google Scholar]
  • 21.Miyamoto H, Nakano E, Wolpert DM, Kawato M. TOPS (Task Optimization in the Presence of Signal-Dependent Noise) Model. 2004;35: 48–58. doi: 10.1002/scj.10377 [DOI] [Google Scholar]
  • 22.Groote F De, Kinney AL, Rao A V, Fregly BJ. Evaluation of Direct Collocation Optimal Control Problem Formulations for Solving the Muscle Redundancy Problem. Ann Biomed Eng. 2016;44: 2922–2936. doi: 10.1007/s10439-016-1591-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Bogert AJ Van Den, Blana D, Heinrich D. Implicit methods for efficient musculoskeletal simulation and optimal control. Procedia IUTAM. Elsevier B.V.; 2011. pp. 297–316. doi: 10.1016/j.piutam.2011.04.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Falisse A, Serrancolí G, Dembia CL, Gillis J, Groote F De. Algorithmic differentiation improves the computational efficiency of OpenSim-based trajectory optimization of human movement. Srinivasan M, editor. PLoS One. 2019;14: e0217730. doi: 10.1371/journal.pone.0217730 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ong CF, Geijtenbeek T, Hicks JL, Delp SL. Predicting gait adaptations due to ankle plantarflexor muscle weakness and contracture using physics-based musculoskeletal simulations. PLoS Comput Biol. 2019;15: e1006993. doi: 10.1371/journal.pcbi.1006993 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Houska B, Diehl M. Robustness and stability optimization of power generating kite systems in a periodic pumping mode. Proceedings of the IEEE International Conference on Control Applications. 2010. pp. 2172–2177. doi: 10.1109/CCA.2010.5611288 [DOI]
  • 27.Houska B. Robust optimization of dynamic systems. 2011. Available: http://dl.sanaye20.ir/RobustOptimazation.pdf. [Google Scholar]
  • 28.Gillis J, Diehl M. A positive definiteness preserving discretization method for lyapunov differential equations. Proc IEEE Conf Decis Control. 2013; 7759–7764. doi: 10.1109/CDC.2013.6761121 [DOI] [Google Scholar]
  • 29.Julier SJ, Uhlmann JK, Durrant-Whyte HF. New approach for filtering nonlinear systems. Proceedings of the American Control Conference. 1995. pp. 1628–1632. doi: 10.1109/acc.1995.529783 [DOI] [Google Scholar]
  • 30.Berret B, Jean F. Stochastic optimal open-loop control as a theory of force and impedance planning via muscle co-contraction. PLOS Comput Biol. 2020;16: e1007414. doi: 10.1371/journal.pcbi.1007414 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Berret B, Conessa A, Schweighofer N, Burdet E. Stochastic optimal feedforward-feedback control determines timing and variability of arm movements with or without vision. PLOS Comput Biol. 2021;17: e1009047. doi: 10.1371/journal.pcbi.1009047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Peterka RJ. Sensorimotor integration in human postural control. J Neurophysiol. 2002;88: 1097–1118. doi: 10.1152/jn.2002.88.3.1097 [DOI] [PubMed] [Google Scholar]
  • 33.Kuo AD. An optimal control model for analyzing human postural balance. IEEE Trans Biomed Eng. 1995;42: 87–101. doi: 10.1109/10.362914 [DOI] [PubMed] [Google Scholar]
  • 34.Groote F De, Blum KP, Horslen BC, Ting LH. Interaction between muscle tone, short-range stiffness and increased sensory feedback gains explains key kinematic features of the pendulum test in spastic cerebral palsy: A simulation study. PLoS One. 2018;13: e0205763. doi: 10.1371/journal.pone.0205763 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Joyce GC, Rack PMH, Westbury DR. The mechanical properties of cat soleus muscle during controlled lengthening and shortening movements. J Physiol. 1969;204: 461–474. doi: 10.1113/jphysiol.1969.sp008924 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Zajac FE. Muscle and tendon: properties, models, scaling, and application to biomechanics and motor control. Critical reviews in biomedical engineering. 1989. pp. 359–411. [PubMed] [Google Scholar]
  • 37.Burdet E, Osu R, Franklin DW, Milner TE, Kawato M. The central nervous system stabilizes unstable dynamics by learning optimal impedance. Nature. 2001;414: 446–449. doi: 10.1038/35106566 [DOI] [PubMed] [Google Scholar]
  • 38.Franklin DW, Liaw G, Milner TE, Osu R, Burdet E, Kawato M. Endpoint stiffness of the arm is directionally tuned to instability in the environment. J Neurosci. 2007;27: 7705–7716. doi: 10.1523/JNEUROSCI.0968-07.2007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Pedotti A, Krishnan V V, Stark L. Optimization of muscle-force sequencing in human locomotion. Math Biosci. 1978;38: 57–76. doi: 10.1016/0025-5564(78)90018-4 [DOI] [Google Scholar]
  • 40.Kooij H Van Der, Peterka RJ. Non-linear stimulus-response behavior of the human stance control system is predicted by optimization of a system with sensory and motor noise. J Comput Neurosci. 2011;30: 759–778. doi: 10.1007/s10827-010-0291-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Horak FB, Kuo AD. Postural Adaptation for Altered Environments, Tasks, and Intentions. Biomechanics and Neural Control of Posture and Movement. Springer New York; 2000. pp. 267–281. doi: 10.1007/978-1-4612-2104-3_19 [DOI] [Google Scholar]
  • 42.Runge CF, Shupert CL, Horak FB, Zajac FE. Ankle and hip postural strategies defined by joint torques. Gait Posture. 1999;10: 161–170. doi: 10.1016/s0966-6362(99)00032-6 [DOI] [PubMed] [Google Scholar]
  • 43.Groote F De, Allen JL, Ting LH. Contribution of muscle short—range stiffness to initial changes in joint kinetics and kinematics during perturbations to standing balance: A simulation study. J Biomech. 2017;55: 71–77. doi: 10.1016/j.jbiomech.2017.02.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Rack PMH, Westbury DR. The short range stiffness of active mammalian muscle and its effect on mechanical properties. J Physiol. 1974;240: 331–350. doi: 10.1113/jphysiol.1974.sp010613 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.van der Kooij H, Jacobs R, Koopman B, van der Helm F. An adaptive model of sensory integration in a dynamic environment applied to human stance control. Biol Cybern. 2001;84: 103–115. doi: 10.1007/s004220000196 [DOI] [PubMed] [Google Scholar]
  • 46.Fitzpatrick R, McCloskey DI. Proprioceptive, visual and vestibular thresholds for the perception of sway during standing in humans. J Physiol. 1994;478: 173–186. doi: 10.1113/jphysiol.1994.sp020240 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Rudolph KS, Axe MJ, Snyder-Mackler L. Dynamic stability after ACL injury: Who can hop? Knee Surgery, Sport Traumatol Arthrosc. 2000;8: 262–269. doi: 10.1007/s001670000130 [DOI] [PubMed] [Google Scholar]
  • 48.Ting LH, Macpherson JM. Ratio of shear to load ground-reaction force may underlie the directional tuning of the automatic postural response to rotation and translation. J Neurophysiol. 2004;92: 808–823. doi: 10.1152/jn.00773.2003 [DOI] [PubMed] [Google Scholar]
  • 49.Assländer L, Peterka RJ. Sensory reweighting dynamics in human postural control. J Neurophysiol. 2014;111. Available: http://jn.physiology.org/content/111/9/1852.full.pdf+html. doi: 10.1152/jn.00669.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Sturnieks DL, George RS, Lord SR. Balance disorders in the elderly. Neurophysiol Clin Neurophysiol. 2008;38: 467–478. doi: 10.1016/j.neucli.2008.09.001 [DOI] [PubMed] [Google Scholar]
  • 51.Kandel ER, Schwartz JH, Jessell TM, Siegelbaum S. Principles of neural science. 2000. Available: https://www.academia.edu/download/30536508/neuroscience_syllabus.pdf. doi: 10.1126/science.287.5451.273 [DOI] [PubMed] [Google Scholar]
  • 52.Peterka RJ. Sensory integration for human balance control. Handb Clin Neurol. 2018;159: 27–42. doi: 10.1016/B978-0-444-63916-5.00002-1 [DOI] [PubMed] [Google Scholar]
  • 53.Macpherson JM, Everaert DG, Stapley PJ, Ting LH. Bilateral vestibular loss in cats leads to active destabilization of balance during pitch and roll rotations of the support surface. J Neurophysiol. 2007;97: 4357–4367. doi: 10.1152/jn.01338.2006 [DOI] [PubMed] [Google Scholar]
  • 54.Craig CE, Goble DJ, Doumas M. Proprioceptive acuity predicts muscle co-contraction of the tibialis anterior and gastrocnemius medialis in older adults’ dynamic postural control. Neuroscience. 2016;322: 251–261. doi: 10.1016/j.neuroscience.2016.02.036 [DOI] [PubMed] [Google Scholar]
  • 55.Nashner LM. Adapting reflexes controlling the human posture. Exp Brain Res. 1976;26: 59–72. doi: 10.1007/BF00235249 [DOI] [PubMed] [Google Scholar]
  • 56.Runge CF, Shupert CL, Horak FB, Zajac FE. Role of vestibular information in initiation of rapid postural responses. Exp Brain Res. 1998;122: 403–412. doi: 10.1007/s002210050528 [DOI] [PubMed] [Google Scholar]
  • 57.Iwamoto Y, Takahashi M, Shinkoda K. Differences of muscle co-contraction of the ankle joint between young and elderly adults during dynamic postural control at different speeds. J Physiol Anthropol. 2017;36: 1–9. doi: 10.1186/s40101-017-0149-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Burdet E, Franklin DW, Milner TE. Human robotics: neuromechanics and motor control. 2013. Available: https://mitpress.mit.edu/books/human-robotics. [Google Scholar]
  • 59.Franklin DW, Wolpert DM. Computational mechanisms of sensorimotor control. Neuron. 2011;72: 425–442. doi: 10.1016/j.neuron.2011.10.006 [DOI] [PubMed] [Google Scholar]
  • 60.Franklin DW, Osu R, Burdet E, Kawato M, Milner TE. Adaptation to Stable and Unstable Dynamics Achieved by Combined Impedance Control and Inverse Dynamics Model. J Neurophysiol. 2003;90: 3270–3282. doi: 10.1152/jn.01112.2002 [DOI] [PubMed] [Google Scholar]
  • 61.Tee KP, Franklin DW, Kawato M, Milner TE, Burdet E. Concurrent adaptation of force and impedance in the redundant muscle system. Biol Cybern. 2010;102: 31–44. doi: 10.1007/s00422-009-0348-z [DOI] [PubMed] [Google Scholar]
  • 62.Burdet E, Osu R, Franklin DW, Yoshioka T, Milner TE, Kawato M. A method for measuring endpoint stiffness during multi-joint arm movements. J Biomech. 2000;33: 1705–1709. doi: 10.1016/s0021-9290(00)00142-1 [DOI] [PubMed] [Google Scholar]
  • 63.Crevecoeur F, Sepulchre RJ, Thonnard JL, Lefèvre P. Improving the state estimation for optimal control of stochastic processes subject to multiplicative noise. Automatica. 2011;47: 591–596. doi: 10.1016/j.automatica.2011.01.026 [DOI] [Google Scholar]
  • 64.Afschrift M, van Deursen R, Groote F De, Jonkers I. Increased use of stepping strategy in response to medio-lateral perturbations in the elderly relates to altered reactive tibialis anterior activity. Gait Posture. 2019;68: 575–582. doi: 10.1016/j.gaitpost.2019.01.010 [DOI] [PubMed] [Google Scholar]
  • 65.Koelewijn AD, van den Bogert AJ. A solution method for predictive simulations in a stochastic environment. J Biomech. 2020;104: 109759. doi: 10.1016/j.jbiomech.2020.109759 [DOI] [PubMed] [Google Scholar]
  • 66.Gillis J. Practical methods for approximate robust periodic optimal control of nonlinear mechanical systems. 2015. [Google Scholar]
  • 67.Scott SH. Optimal feedback control and the neural basis of volitional motor control. Nat Rev Neurosci. 2004;5: 532–544. doi: 10.1038/nrn1427 [DOI] [PubMed] [Google Scholar]
  • 68.Loeb GE. Optimal isn’t good enough. Biol Cybern. 2012;106: 757–765. doi: 10.1007/s00422-012-0514-6 [DOI] [PubMed] [Google Scholar]
  • 69.Osu R, Kamimura N, Iwasaki H, Nakano E, Harris CM, Wada Y, et al. Optimal impedance control for task achievement in the presence of signal-dependent noise. J Neurophysiol. 2004;92: 1199–1215. doi: 10.1152/jn.00519.2003 [DOI] [PubMed] [Google Scholar]
  • 70.Pruszynski JA, Kurtzer I, Scott SH. The long-latency reflex is composed of at least two functionally independent processes. J Neurophysiol. 2011;106: 449–459. doi: 10.1152/jn.01052.2010 [DOI] [PubMed] [Google Scholar]
  • 71.Pruszynski JA, Kurtzer I, Lillicrap TP, Scott SH. Temporal evolution of “automatic gain-scaling.” J Neurophysiol. 2009;102: 992–1003. doi: 10.1152/jn.00085.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Pinter IJ, Soest AJ Van, Bobbert MF, Smeets JBJ. Conclusions on motor control depend on the type of model used to represent the periphery. Biol Cybern. 2012;106: 441–451. doi: 10.1007/s00422-012-0505-7 [DOI] [PubMed] [Google Scholar]
  • 73.Lockhart DB, Ting LH. Optimal sensorimotor transformations for balance. Nat Neurosci 2007 1010. 2007;10: 1329–1336. doi: 10.1038/nn1986 [DOI] [PubMed] [Google Scholar]
  • 74.Kalman RE. A new approach to linear filtering and prediction problems. J Fluids Eng Trans ASME. 1960;82: 35–45. doi: 10.1115/1.3662552 [DOI] [Google Scholar]
  • 75.Wächter A, Biegler LT. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Math Program. 2006;106: 25–57. doi: 10.1007/s10107-004-0559-y [DOI] [Google Scholar]
  • 76.Millard M, Uchida T, Seth A, Delp SL. Flexing Computational Muscle: Modeling and Simulation of Musculotendon Dynamics. J Biomech Eng. 2013;135. doi: 10.1115/1.4023390 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Andersson JAE, Gillis J, Horn G, Rawlings JB, Diehl M. CasADi: a software framework for nonlinear optimization and optimal control. Math Program Comput. 2019;11: 1–36. doi: 10.1007/s12532-018-0139-4 [DOI] [Google Scholar]
  • 78.Delp SL, Anderson FC, Arnold AS, Loan P, Habib A, John CT, et al. OpenSim: Open source to create and analyze dynamic simulations of movement. IEEE Trans Biomed Eng. 2007;54: 1940–1950. doi: 10.1109/TBME.2007.901024 [DOI] [PubMed] [Google Scholar]
  • 79.Mouel C Le, Brette R. Mobility as the Purpose of Postural Control. Front Comput Neurosci. 2017;11: 67. doi: 10.3389/fncom.2017.00067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Jones KE, Hamilton AF, Wolpert DM. Sources of Signal-Dependent Noise During Isometric Force Production. J Neurophysiol. 2002; 1533–1544. doi: 10.1152/jn.2002.88.3.1533 [DOI] [PubMed] [Google Scholar]
  • 81.Singh NB, König N, Arampatzis A, Taylor WR. Age-related modifications to the magnitude and periodicity of neuromuscular noise. PLoS One. 2013;8: 2–7. doi: 10.1371/journal.pone.0082791 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Koelewijn AD, Ijspeert AJ. Exploring the Contribution of Proprioceptive Reflexes to Balance Control in Perturbed Standing. Front Bioeng Biotechnol. 2020;8: 866. doi: 10.3389/fbioe.2020.00866 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Sozzi S, Nardone A, Schieppati M. Vision Does Not Necessarily Stabilize the Head in Space During Continuous Postural Perturbations. Front Neurol. 2019;10: 748. doi: 10.3389/fneur.2019.00748 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009338.r001

Decision Letter 0

Wolfgang Einhäuser, Adrian M Haith

3 Sep 2021

Dear Mr Van Wouwe,

Thank you very much for submitting your manuscript "An approximate stochastic optimal control framework to simulate nonlinear neuromusculoskeletal models in the presence of noise" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers were broadly enthusiastic about the approach and its potential to extend the scope of problems for which optimal control policies can be determined and compared with data. Two reviewers raised some important concerns, however, related to the presentation of the work, technical details of the models, and the relationship of the proposed approach to other recently published methods (e.g. Berret and Jean, PLOS Computational Biology, 2020). In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Adrian M Haith

Associate Editor

PLOS Computational Biology

Wolfgang Einhäuser

Deputy Editor

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This article presents an approximate stochastic optimal control framework for investigating feedforward and feedback control in nonlinear neuromusculoskeletal systems with sensorimotor noise. Several simulations are performed for standing and reaching tasks with perturbations and the results are compared to previous experimental data.

Overall, the paper is well-written and develops interesting ideas about the modeling of sensorimotor control. It makes original contributions by applying the framework to standing and using state-of-the-art neuromusculoskeletal models for instance. However, several points need to be addressed to complete the literature, improve the consistency of the modeling work, and clarify its implications for the neural control of movement.

Main comments

First, it must be noted that recent papers developed and applied a very similar approximate stochastic optimal control framework to arm reaching with noisy nonlinear musculoskeletal models (see references 1-3 below). Currently, these papers are not referenced. Due to the similarity with these approaches, the authors should clearly stress and discuss the novelty of their results with respect to these studies as well as revise several statements throughout the paper accordingly.

Second, there are some inconsistencies in the modeling work. The general frameworks of the main text and of the SI text do not correspond exactly, and some discrepancy between the general framework and the specific examples can be noted. For instance, in the standing task, the feedforward control acts at the level of muscle activations (a_base) but both the feedforward and feedback control terms should act at the level of muscle excitations if we refer to the general framework (as done for reaching). Similarly, motor noise should be introduced at the level of muscle excitations if we refer to the general framework described in SI, but both in the standing and reaching tasks it is introduced at the level of muscle force/torque (yet it fits with the framework described in the main text, L. 167). Furthermore, the proposed framework seems to use a reference trajectory (pref). However, pref(t) is not mentioned in the beginning of the Results but appears later in the Methods and in the SI text. These inconsistencies should be fixed to have a general framework that matches the specific examples. This also raises specific questions.

In the standing task, can the authors use some e_ff(t) (feedforward muscle excitations) rather than a_base (feedforward muscle activations) as control? Does it influence the findings about co-contraction? Also, why assuming constant controls in the standing task whereas the framework allows for time-varying controls (L. 621-622)? Could it be possible to introduce motor noise on muscle excitations directly (as suggested in SI) rather than on muscle forces/torques and, again, would it influence the results? Finally, as pref(t) seems to be a deterministic function of the mean trajectory in simulations (L. 750-751), does pref(t) need to be explicitly part of the optimization or could yfb be defined as yfb=h(x,xmean)? If the framework really needs pref(t), its nature and role must be better explained from the beginning.

Third, the way feedback is modeled in the present paper may be somewhat confusing. This could explain the failure to predict a significant muscle co-contraction in the simulation of the divergent force field experiment (discussed L. 467-489). The reason for this failure thus deserves to be clarified because previous simulations assuming feedforward-only control had predicted muscle co-contraction for unstable tasks with a similar framework (1). Here the current lack of co-contraction might be related to an overestimated efficiency of high-level feedback control in this task because state estimation processes and sensory delays are not explicitly modeled. Although the authors explain that the time constant (tau) of the first-order muscle dynamics “lumps together sensory and motor delays”, it is still different from a feedback controller using delayed sensory measurements. Here, yfb(t) is known instantaneously from the current state x(t) and it thus affects muscle excitations (control) instantaneously. The constant tau mainly specifies that muscle excitations cannot modify muscle force instantaneously but does not specify that the controller has no access to the current state (and must infer it). Thus, in a divergent force field, the brain may not know the current hand’s state as accurately as assumed by the present modeling, which could explain why muscle co-contraction is observed in experimental data. Therefore, would the model predict more muscle co-contraction if one assumes more uncertainty about yfb(t) in the divergent force field task or considers explicitly a delayed feedback signal yfb(t-delay)?

By extension, it is unclear whether the modeled feedback indirectly captures the effects of long-latency or short-latency reflexes, or a mix of both. The fact that the feedback signal yfb involves task-level variables (e.g., end-effector position/velocity) points toward a supraspinal processing (thus long-latency reflexes). However, would the results change if feedback signals involved only joint positions/velocities (i.e., q and qdot directly, without the forward kinematics mapping)? If they don’t, together with the absence of explicit modeling of sensory delays and state estimation process, this could rather point toward short-latency spinal reflexes, the gains of which increase with muscle co-contraction as noted by the authors (L. 487-489). If these considerations can affect the interpretation of K(t) and its contribution to impedance control in the proposed model, this should be better tested and stressed throughout the paper.

Other comments

Hand velocity profiles should be depicted to fully visualize trajectories in the reaching tasks. Also, it could be interesting to plot the optimal deterministic controls (in a supplementary file if needed).

Could you please detail the derivation of Eq. 28 from Eq. 1 in the SI? The reason why cross terms disappear was not clear to me. Also, is Eq. 38 in SI based on a Taylor expansion?

Note that the SI file still contains some comments from the authors. I suggest to carefully review all equations within. I did not check all equations there because they are more secondary to the article, but I am not sure that M_k is defined in Eq. 30 and the following.

Throughout the paper, the modeling work involves stochastic differential equations but uses the formalism of ordinary differential equations. Why?

Figures 2 and 3 seem to have been inverted. Please check the captions as well. In Fig 2B, please indicate the experimental conditions for each graph.

L. 527: the dynamics was denoted f(x,e,w) in the Results and becomes f(x,u,w) here. Why not introducing u(t) already from the Results?

L. 537: Why is there a prime in Eq. 2 for sigma_w? If it is the transposition operator, please note that it is also denoted by T elsewhere.

In Eq. 5 (L. 541), the expectation is not needed if all the quantities are deterministic. Please note that J is already used L. 166 to denote the integral cost (not integrand).

Please indicate the time interval in the integrals (e.g. L. 166), and mention if the optimal control problems are considered in finite horizon or if the framework is also considered in infinite horizon (as asymptotic stability is mentioned in the standing task?).

L. 403-405: this statement is not true if we consider the references below (1-3)

L. 490-492: but see previous works (1-3)

L. 501-504: this was already observed in (1)

L. 592-596: I did not understand this paragraph well. Is it to specify the constraints of the optimal control problem? Why using these assumptions that seem to be quite strong? Please clarify.

L. 689: why not considering biarticular muscles since the model and methods allowed it? Then, the authors could have directly reused existing neuromechanical models without having to adjust muscle parameters. I did not get the reason why such a simplification was implemented here.

L. 745: "Tmotor the stochastic torque acting at shoulder and elbow resulting from motor noise ws,we.". This sentence sounds strange as motor noise is added to Tmotor. Please verify.

References

1. https://doi.org/10.1371/journal.pcbi.1007414

2. https://doi.org/10.1371/journal.pcbi.1009047

3. https://doi.org/10.1016/j.automatica.2020.108874

Reviewer #2: 1. Lines 45, 46. Suggest ‘. . . to be identified since task constraints . . . penalty terms in a cost function.’

2. Line 64, Suggest ‘. . . to aid . . .’

3. Lines 105. 106. There are many mentions of ‘collocation’ and ‘shooting methods’ throughout the manuscript beginning on these lines. While the given reference [5] provides a window into learning more about what exactly are these methods, this paper would be more understandable if some definition or description of what is meant by these terms are provided. Perhaps an appropriate place would be in the Supplementary material or possibly the Discussion.

4. Line 127. References [27-[29] are given for Houska. Is this correct? Or should it be [25]-[27]?

5. Line 194. This is the first mention of short-range stiffness. The acronym SRS should be defined here.

6. Line 200. Should be ‘. . . noise was added to . . .’

7. Line 222. Reference [47] is a reference to a very general neuroscience textbook. Please use a more appropriate reference like [31]

8. Line 229. CCI was not previously defined and is only defined in the Figure 1 legend. It should be defined here. Similarly, SRS was not defined previously. Also, the labels ‘healthy-SRS’ and ‘VL-SRS’ used here and in Figure 1 is somewhat confusing since the ‘-‘ could be interpreted as a minus sign in the sense that the model is subtracting out the SRS component rather than including it. Suggest maybe ‘healthy with SRS’ and ‘VL with SRS’ be used here and in Figure 1. Also, this labeling is used in the Methods section lines 609, 610.

9. Lines 243, 244. This is an interesting prediction regarding a shift toward increased use of vestibular information for balance with increasing platform translation amplitude. This is an example of model prediction that could potentially be tested in future experiments to test the validity of the prediction (and the methods used to make the prediction) as well as revealing something new about balance control. You might want to highlight this in the Discussion.

10. Figure 1. For the experimental rotation results showing in part A, there are experiment sensory weight measures (the Kproprio/( Kproprio+Kvest) measure) from four VL subjects as a function of rotation magnitude (Figure 10 in reference [31]). These could be shown and would be in agreement with the model-predicted results.

11. Figures 2 and 3. I note that in the reviewer download of the manuscript pdf Figures 2 and 3 were reversed, but not in the pre-print pdf.

12. Figure 2. In part B the ‘Reaching accuracy’ plots should be centered below their respective reaching trajectories.

13. Figure 3C. The vertical line indicating the perturbation time for the Posterior Deltoid is missing.

14. Line 385. Should be ‘. . . to trade them off . . .’

15. Equation 8. In comparison to equation 10 in the Supplementary Information, it’s not clear why the lambda variable in equation 8 has an ‘i’ subscript. The information given at the end of the following paragraph suggest that lambda would have a fixed value.

16. Line 648. Should be ‘Table 2’.

17. Line 656. No et al. in reference [31].

18. Line 671 Table description. It’s unusual to have such a long table description. Suggest that this description would be better made a part of the text since it also applies to the reaching simulations (or any system that is simulated).

19. Line 580 and 696. Is this ‘first-order dynamics’ with 150ms time constant meant to represent or approximate actual time delays in a closed loop feedback system because it’s not possible to represent a true time delay in their model formulation? If so, it would be informative for the authors to state whether their methods allow for the inclusion of a true time delay and thus necessitating use of some approximation.

20. Lines 731, 732. This is text left over from the rotation section and should be deleted.

21. Reference section. There are numerous references with incomplete information including 2, 3, 18, 25, 27, 28, 54, 56, 63, and 65.

Supplementary Information comments:

1. First line after equation 44. Should be ‘. . . consider the first term in . . .’

2. Section (2) below equation 45. Suggest ‘One could argue that delta must be chosen to be a smaller value, but . . .’

3. Incomplete supplement references include 1, 5, and 9.

Reviewer #3: See attachment.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No: The experimental data discussed in the manuscript are only examples from previous studies by others and not original data. So the presented experimental data is not an issue. The manuscript provides detailed descriptions of the methods and equations that comprise their methods but there is no explicit computer code that implements any of their examples. It would be hard, but not impossible, for another researcher to start from scratch to implement their method so actual example code would be beneficial.

Reviewer #3: No: The manuscript includes a statement that code will be made available via Github. No link was provided yet.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Bastien Berret

Reviewer #2: No

Reviewer #3: Yes: Antonie J. van den Bogert

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Attachment

Submitted filename: PCOMPBIOL-D-21-01394_comments.docx

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009338.r003

Decision Letter 1

Wolfgang Einhäuser, Adrian M Haith

16 Mar 2022

Dear Mr Van Wouwe,

Thank you very much for submitting your manuscript "An approximate stochastic optimal control framework to simulate nonlinear neuromusculoskeletal models in the presence of noise" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication. However, the reviewers pointed out a number of minor issues that should be addressed first.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Adrian M Haith

Associate Editor

PLOS Computational Biology

Wolfgang Einhäuser

Deputy Editor

PLOS Computational Biology

***********************

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have provided satisfactory answers to most of my concerns. The article now provides a clearer contribution to the modeling of motor control, which will hopefully trigger future developments in this area.

Please note that the paper still contains a number of typos that should be corrected before publication.

For instance, in the main text:

Fitt’s law -> Fitts’ law

‘to perform a point-to-point reaching movement over a distance of 25cm in 0.5s’ -> apparently it was rather 0.8s.

Corriolis -> Coriolis

Please check Eq. 25 (Fiso and activation seem to be inverted)

Eq. 2627 is misnumbered

And in the Supplementary Information:

missing transpose in Eq. 7 and below (for the term in C(t))

please correct the ‘dynamics wherethe controls:: ‘

‘reference trajectory ()’ is still mentioned

Reviewer #2: This reviewer has only a few minor comments. The manuscript pdf did not include line numbers or page numbers. My references to page numbers below begin with the title page being page 1.

Minor comments:

1) Page 4. The references [35] [36] apparently should be [34] [35].

2) Page 4, three lines below the above references, suggest eliminating the first ‘the’ in the sentence so the sentence reads ‘This does not seem to yield an optimal solution as during the optimization . . .’

3) The SRS acronym is still not clearly defined. The place to define it is on page 5 (near the middle of the page) were ‘short-range stiffness’ is first mentioned.

4) Page 7. My previous review suggested that the Kandel reference (now reference [53]) should not be used for the explanations included on page 7 since this is a general textbook where it would be very hard to find what is discussed in this section. The original reference for the data shown in Kandel is the Peterka 2002 reference (now reference [36]). I also suggested that Figure 1A should include the proprioceptive weight measures shown for vestibular loss (VL) subjects. These VL proprioceptive weights are shown in Figure 10 of [36]. The authors apparently did not understand that Figure 10 does show proprioceptive weight measures from four VL subjects obtained in an eyes closed, surface-tilt stimulus condition that is relevant to the discussion in this section (as well as other conditions that are not relevant here). Another reference that more clearly presents that Peterka 2002 weights for the eyes closed, surface tilt condition is in Figure 2.5A of the reference:

Peterka, R.J., Sensory integration for human balance control. Handbook of Clinical Neurology, Vol 159 (3rd series) Balance, Gait and Falls, B.L. Day and S.R. Lord editors, 159:27-42, 2018.

5) Page 9 near the middle of the page, the sentence starting ‘Similar to the experiments, . . ‘. Suggest to use ‘ . . . for the obstacle task compared to the circle task as the second part . . .’

6) Page 17, equation (11). This equation included the variable ‘a’ but the next line refers to ‘a_base’.

7) Page 23 just before the [81] reference. Units should be (Nm)^2/Hz.

8) Page 5 of ‘Supplementary Information’ document. At the bottom of page 5 there are 5 equations shown but the equation numbers include equation (4) for which there is no actual equation shown. Assuming equation (4) should not exist, its elimination would affect subsequent equation labeling.

9) Page 6 of ‘Supplementary Information’ document. Should the gamma variable in equation (10) include a ‘i’ subscript as in equation (8) in the main manuscript?

10) Page 8 of ‘Supplementary Information’ document. Between equations (27) and (28) there is the ‘s.t.’ abbreviation. Better to spell out ‘subject to’.

11) Page 10 of ‘Supplementary Information’ document. Under point (2) of the ‘We observed the following problems.’ section, better to say ‘. . . must be chosen to be smaller, but . . .’

Reviewer #3: I am entirely satisfied with the author's responses and manuscript revisions.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Bastien Berret

Reviewer #2: No

Reviewer #3: Yes: Antonie J. van den Bogert

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

References:

Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009338.r005

Decision Letter 2

Wolfgang Einhäuser, Adrian M Haith

11 Apr 2022

Dear Mr Van Wouwe,

We are pleased to inform you that your manuscript 'An approximate stochastic optimal control framework to simulate nonlinear neuromusculoskeletal models in the presence of noise' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Adrian M Haith

Associate Editor

PLOS Computational Biology

Wolfgang Einhäuser

Deputy Editor

PLOS Computational Biology

***********************************************************

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009338.r006

Acceptance letter

Wolfgang Einhäuser, Adrian M Haith

9 May 2022

PCOMPBIOL-D-21-01394R2

An approximate stochastic optimal control framework to simulate nonlinear neuromusculoskeletal models in the presence of noise

Dear Dr Van Wouwe,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Anita Estes

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Describes additional results, motivation and consequences on assuming rigid tendons and more details on the implemented method.

    (PDF)

    Attachment

    Submitted filename: PCOMPBIOL-D-21-01394_comments.docx

    Attachment

    Submitted filename: ResponseToReviewers.docx

    Attachment

    Submitted filename: Answers to reviewer.docx

    Data Availability Statement

    No original experimental data was produced. Code is available on GitHub at https://github.com/tomvanwouwe1992/SOC_Paper Code runs in MATLAB R2020a.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES