Skip to main content
. Author manuscript; available in PMC: 2015 Jan 12.
Published in final edited form as: Neural Comput. 2013 Sep 18;25(12):3263–3293. doi: 10.1162/NECO_a_00521

Figure 1.

Figure 1

Overview of model. A virtual arm with joint angles θsh and θel (θsh: angle of upper arm with respect to x-axis; θel: angle of forearm with respect to upper arm) controlled by two pairs of flexor and extensor muscles, is trained to reach toward a target. A proprioceptive (P) sensory area translates muscle lengths into an arm configuration representation. Plasticity is present in excitatory-to-excitatory recurrent connections within the higher-order sensory (S) and the motor (M) areas, in feedforward and feedback excitatory to excitatory connections between the higher-order sensory and the motor areas, and in feedforward connections from excitatory to inhibitory cells within each area. Motor units drive the muscles to change the joint angle. The actor is trained by the critic, which evaluates error and provides a global reward or punishment signal.