Skip to main content
[Preprint]. 2023 Mar 15:arXiv:2302.02477v3. [Version 3]

Figure 3:

Figure 3:

Timeline for training RL-based DBS controllers in clinical studies. Since only limited data can be collected during each clinical visit, offline RL can be used to fine-tune existing or train new controllers using all the historical data. Then, offline policy evaluation (OPE) facilitates choosing the possible top-performing ones to be tested in the next visit.