Abstract
Rapid human arm movements often have velocity pro.les consisting of
several bell-shaped
accelerationdeceleration phases, sometimes overlapping in time and
sometimes appearing separately.
We show how such sub-movement sequences can emerge naturally as an
optimal control
policy is approximated by a reinforcement learning system in the face
of uncertainty and feedback
delay. The system learns to generate sequences ofpulse-step commands,
producing fast
initial sub-movements followed by several slow corrective sub-movements
that often begin before
the initial sub-movement has completed. These results suggest how
the nervous system
might e3ciently control a stochastic motor plant under uncertainty
and feedback delay.
Users
Please
log in to take part in the discussion (add own reviews or comments).