Skip to main content
. 2025 Mar 11;28(4):112203. doi: 10.1016/j.isci.2025.112203

Figure 2.

Figure 2

Gait-kinematics for RL agents for all models during walking and running

Shown are the hip, knee, ankle and GRF values averaged over 5 rollouts of 10 s walking and running on flat ground. We excluded rollouts that did not achieve the whole episode length to clearly highlight the achieved kinematics. For walking, we observe slight discrepancies between experimental data (gray) and the RL behaviors (red), which are bigger for high-dimensional models.

(A) The experimental data show human subjects walking at 1.2m/s and is included in SCONE.44

(B) The experimental data show human subjects running at 5m/s and was extracted from Hamner et al.57 For running, the behaviors deviate overall more from experimental data, a weak knee flexion can be observed. Nevertheless, the characteristic single-peaked GRF curve is present for all models, except for the MyoLegV0. The proposed reward function provides a strong and flexible starting point for researchers aiming to create robust and natural controllers for high-dimensional musculoskeletal systems. Also, see the videos on the website. In all simulations, the values (red) are averaged over both legs and 5 rollouts of 10 s are recorded. The experimental data (shaded) is shown as maximum and minimum values over all subjects.