| RL | Reinforcement learning |
| PPO | Proximal policy optimization |
| D-H | Denavit–Hartenberg |
| RRT | Rapidly-exploring random tree |
| GAE | Generalized advantage estimation |
| RL | Reinforcement learning |
| PPO | Proximal policy optimization |
| D-H | Denavit–Hartenberg |
| RRT | Rapidly-exploring random tree |
| GAE | Generalized advantage estimation |