Table 5:
Value estimates for online V-learning simulations with universal and patient-specific policies when γ = 0.9.
| n | T | Universal policy | Patient-specific policy |
|---|---|---|---|
| 25 | 24 | 0.0282 | 0.1813 |
| 36 | 0.1025 | 0.1700 | |
| 48 | 0.0977 | 0.1944 | |
| 50 | 24 | 0.0164 | 0.2771 |
| 36 | 0.0768 | 0.2617 | |
| 48 | 0.0752 | 0.3038 | |
| 100 | 24 | 0.0160 | 0.4230 |
| 36 | 0.0960 | 0.2970 | |
| 48 | 0.1140 | 0.3197 |