Algorithm 3 Contextual bandit for clinical trial optimization |
-
1:
for do ▹ initialize all context-free bandits
-
2:
initialize context-free bandit()
-
3:
fordo
-
4:
observe context(patient i)
-
5:
▹ bandit associated with context
-
6:
select treatment() ▹ select a treatment based on priors in
-
7:
Apply and observe
-
8:
update prior(, ) ▹ update the prior of in
|