Table 5.
Environment parameters across endophenotypes controlled by the RL model
| Name | Description | Value |
|---|---|---|
| NT | Number of states | 22 |
| NG | Number goal states | 1 |
| ND | Number drug/aftereffect states | 15 |
| Nn | Number neutral states | 6 |
| Na | Number of actions | 9 |
| S0 | Starting state | 4 |
| Rp | Punishment at the end of drug/aftereffect consumption | -4 |
| Rc | Punishment in drug/aftereffect area | -1.2 |
| Rdd | Reward at drug consumption (f2,f4) | 10 |
| Rdt | Reward at drug consumption in treatment | -1 |
| Rg | Reward when entering goal state | 1 |
| dinit | Duration initial (no drug) phase | 50 |
| ddrug1 | Duration first drug phase | 1000 |
| dtpy | Duration treatment phase | 1000 |
| ddrug2 | Duration second drug phase | 600 |