| DRL | Deep reinforcement learning |
| RL | Reinforcement learning |
| PRM | Probabilistic Roadmap |
| TD3 | Twin Delayed Deep Deterministic Policy Gradients |
| SLAM | Simultaneous Localization and Mapping |
| AMCL | Adaptive Monte Carlo Localization |
| A* | A-star search algorithm |
| RRT | Rapidly-exploring Random Trees |
| APF | Artificial Potential Field |
| DWA | Dynamic Window Approach |
| TEB | Timed Elastic Band |
| CNN | Convolutional neural networks |
| A3C | Asynchronous advantage Actor-Critic |
| CARML | Collision avoidance robotics via meta-learning |
| MDP | Markov Decision Process |
| ROS | Robot Operation System |
| URDF | Unified Robot Description Format |
| ODE | Open Dynamics Engine |
| PPO | Proximal Policy Optimization |
| DDPG | Deep Deterministic Policy Gradient |
| SAC | Soft Actor-Critic |