Skip to main content
. 2023 Jun 27;23(13):5974. doi: 10.3390/s23135974
RL Reinforcement learning
PPO Proximal policy optimization
D-H Denavit–Hartenberg
RRT Rapidly-exploring random tree
GAE Generalized advantage estimation