Skip to main content
. Author manuscript; available in PMC: 2021 Aug 1.
Published in final edited form as: Surgery. 2020 Jun 13;168(2):253–266. doi: 10.1016/j.surg.2020.04.049

Figure 3: Reinforcement learning framework and clinical application.

Figure 3:

An algorithm interacts with its environment (consisting of data from electronic health records or datasets) to learn states (representing disease or patient acuity), actions that lead to new states, probabilities of transitioning between states, and associations between state transitions and an ultimate goal, such as survival or discharge to home in good health. The algorithm then identifies actions that are most likely to achieve the ultimate goal. This process can occur within a Markov Decision Process framework and apply to a patient presenting with bowel obstruction, estimating the clinical utility of observation and operative exploration in response to evolving clinical conditions.