Skip to main content
. 2021 Apr 1;16(4):e0249030. doi: 10.1371/journal.pone.0249030

Fig 9.

Fig 9

Learning Curves: (a) Learning graphs of different agents, b) Reward graph over episodes: Vanilla DQN and DDQN agents, c) Reward graph over episodes: Actor Critic, Rule agnet and Random agent, d) Reward graph over episodes: DDQN agents, (e) Success rate over episodes: DDQN agents, (f) Dialogue length over episodes: DDQN agents.