Sutton RS, Mcallester D, Singh S, Mansour Y. (2000). Policy gradient methods for reinforcement learning with function approximation Advances in neural information processing systems. 12

See more from authors: Sutton RS · Mcallester D · Singh S · Mansour Y

References and models cited by this paper
References and models that cite this paper

Chhabra M, Jacobs RA. (2006). Properties of synergies arising from a theory of optimal motor behavior. Neural computation. 18 [PubMed]

This website requires cookies and limited processing of your personal data in order to function. By continuing to browse or otherwise use this site, you are agreeing to this use. See our Privacy policy and how to cite and terms of use.