Barto AG, Sutton RS, Anderson CW. (1983). Neuronlike elements that can solve difficult learning control problems IEEE Trans Systems Man Cybern. 13

See more from authors: Barto AG · Sutton RS · Anderson CW

References and models cited by this paper
References and models that cite this paper

Fiete IR, Fee MS, Seung HS. (2007). Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. Journal of neurophysiology. 98 [PubMed]

Fujita H, Ishii S. (2007). Model-based reinforcement learning for partially observable games with sampling-based state estimation. Neural computation. 19 [PubMed]

Izhikevich EM. (2007). Solving the distal reward problem through linkage of STDP and dopamine signaling. Cerebral cortex (New York, N.Y. : 1991). 17 [PubMed]

Masse NY, Grant GD, Freedman DJ. (2018). Alleviating catastrophic forgetting using context-dependent gating and synaptic stabilization. Proceedings of the National Academy of Sciences of the United States of America. 115 [PubMed]

Morimoto J, Doya K. (2005). Robust reinforcement learning. Neural computation. 17 [PubMed]

Morimoto J, Doya K. (2007). Reinforcement learning state estimator. Neural computation. 19 [PubMed]

Swinehart CD, Abbott LF. (2005). Supervised learning through neuronal response modulation. Neural computation. 17 [PubMed]

Wörgötter F, Porr B. (2005). Temporal sequence learning, prediction, and control: a review of different models and their relation to biological mechanisms. Neural computation. 17 [PubMed]

This website requires cookies and limited processing of your personal data in order to function. By continuing to browse or otherwise use this site, you are agreeing to this use. See our Privacy policy and how to cite and terms of use.