Skip to main content
. 2025 Sep 1;10(9):577. doi: 10.3390/biomimetics10090577
DRL Deep Reinforcement Learning
HRL Hierarchical Reinforcement Learning
PPO Proximal Policy Optimization
HAC Hierarchical Actor-Critic
A3C Asynchronous Advantage Actor-Critic
DT Decision Transformer
SMDP Semi-Markov Decision Process
DT-HRL Decision Transformer-based Hierarchical Reinforcement Learning
NLP Natural Language Processing
RTG Return-To-Go
PEL Path-Efficiency Loss
MTGC-DT Multi-Task Goal-Conditioned Decision Transformer