Skip to main content
. 2022 May 18;9(5):211721. doi: 10.1098/rsos.211721

Figure 2.

Figure 2.

Results in the horizontal and sagittal policies. (ad) indicate the horizontal targets, and (eh) indicate sagittal targets. (a) An example of snapshots in the horizontal targets. (b) Trajectories in the horizontal targets. Black cross symbols and bullets indicate qtarget and qinitial, respectively. (c) Learning progress. Trajectories at the 0, 10, 20, 30, 50, 100, 150, 200, 250 and 300 (×103)th iterations are shown. (d) Trajectories in the various targets with different θ, including unlearned ones. (e) An example of snapshots in the sagittal targets. (f) Trajectories in the sagittal targets. (g) Learning progress. Trajectories at the 0, 10, 20, 30, 50, 100, 150, 200, 250 and 300 (×103)th iterations are shown. (h) Trajectories in the various targets with different θ, including unlearned ones.