Table 3.
Optimal policy across endophenotypes controlled by the RL model (2nd drug phase)
| State number | State type | Action | Q value |
|---|---|---|---|
| 1 | Goal | ag | 2.8967 |
| 2 | Neutral | ag | 2.607 |
| 3 | Neutral | as=2 | 2.3439 |
| 4 | Neutral | as=3 | 2.1074 |
| 5 | Neutral | as=4 | 1.8948 |
| 6 | Neutral | as=5 | 1.7036 |
| 7 | Neutral | as=6 | 1.5317 |
| 8 | Drug | ad | -10.1134 |
| 9 | Drug-aftereffect | ad | -10.3781 |
| 10 | Drug-aftereffect | aw | -10.4882 |
| 11 | Drug-aftereffect | aw | -10.2809 |
| 12 | Drug-aftereffect | aw | -9.7099 |
| 13 | Drug-aftereffect | aw | -8.6469 |
| 14 | Drug-aftereffect | aw | -6.8532 |
| 15 | Drug-aftereffect | aw | -3.9265 |
| 16 | Drug-aftereffect | ad | -5.2928 |
| 17 | Drug-aftereffect | ad | -6.4251 |
| 18 | Drug-aftereffect | ad | -7.3633 |
| 19 | Drug-aftereffect | ad | -8.1408 |
| 20 | Drug-aftereffect | ad | -8.7849 |
| 21 | Drug-aftereffect | ad | -9.318 |
| 22 | Drug-aftereffect | ad | -9.7575 |