Skip to main content
. 2014 Jun 11;8:62. doi: 10.3389/fncom.2014.00062

Figure 4.

Figure 4

Average rewards for learning tetherball on the real robot. Mean and standard deviation of three trials. In all of the three trials, after 50 iterations the robot has found solutions to wind the ball around the pole on either side.