Skip to main content
. 2018 Jul 17;5(4):ENEURO.0151-18.2018. doi: 10.1523/ENEURO.0151-18.2018

Table 3.

Optimal policy across endophenotypes controlled by the RL model (2nd drug phase)

State number State type Action Q value
1 Goal ag 2.8967
2 Neutral ag 2.607
3 Neutral as=2 2.3439
4 Neutral as=3 2.1074
5 Neutral as=4 1.8948
6 Neutral as=5 1.7036
7 Neutral as=6 1.5317
8 Drug ad -10.1134
9 Drug-aftereffect ad -10.3781
10 Drug-aftereffect aw -10.4882
11 Drug-aftereffect aw -10.2809
12 Drug-aftereffect aw -9.7099
13 Drug-aftereffect aw -8.6469
14 Drug-aftereffect aw -6.8532
15 Drug-aftereffect aw -3.9265
16 Drug-aftereffect ad -5.2928
17 Drug-aftereffect ad -6.4251
18 Drug-aftereffect ad -7.3633
19 Drug-aftereffect ad -8.1408
20 Drug-aftereffect ad -8.7849
21 Drug-aftereffect ad -9.318
22 Drug-aftereffect ad -9.7575