Dosimetric comparison between RL agent plans and clinical plans. The boxes represent quartiles, and the whiskers mark the datapoints within the 1.5 IQR from the median values. The clinical constraints for bowel D1cc, duodenum D1cc, stomach D1cc are 33 Gy. Cord Dmax is limited below 20 Gy, and kidney V12Gy is limited below 25–50%. All clinical plans and RL agent plans meet these clinical constraints. Reprinted from An interpretable planning bot for pancreas stereotactic body radiation therapy. Int J Radiat Oncol Biol Phys, 109(4):1076-1085, with permission from Elsevier. RL, reinforcement learning; IQR, interquartile ranges.