Sutton RS. (1988). Learning to predict by the method of temporal diferences Machine Learning. 3

See more from authors: Sutton RS

References and models cited by this paper
References and models that cite this paper

Daw ND, Courville AC, Touretzky DS. (2006). Representation and timing in theories of the dopamine system. Neural computation. 18 [PubMed]

Florian RV. (2007). Reinforcement learning through modulation of spike-timing-dependent synaptic plasticity. Neural computation. 19 [PubMed]

Frank MJ. (2005). Dynamic dopamine modulation in the basal ganglia: a neurocomputational account of cognitive deficits in medicated and nonmedicated Parkinsonism. Journal of cognitive neuroscience. 17 [PubMed]

Frank MJ. (2006). Hold your horses: a dynamic computational role for the subthalamic nucleus in decision making. Neural networks : the official journal of the International Neural Network Society. 19 [PubMed]

Fujita H, Ishii S. (2007). Model-based reinforcement learning for partially observable games with sampling-based state estimation. Neural computation. 19 [PubMed]

Hasselmo ME. (2005). A model of prefrontal cortical mechanisms for goal-directed behavior. Journal of cognitive neuroscience. 17 [PubMed]

Hazy TE, Frank MJ, O'reilly RC. (2007). Towards an executive without a homunculus: computational models of the prefrontal cortex/basal ganglia system. Philosophical transactions of the Royal Society of London. Series B, Biological sciences. 362 [PubMed]

Izhikevich EM. (2007). Solving the distal reward problem through linkage of STDP and dopamine signaling. Cerebral cortex (New York, N.Y. : 1991). 17 [PubMed]

Montague PR, Dayan P, Sejnowski TJ. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. The Journal of neuroscience : the official journal of the Society for Neuroscience. 16 [PubMed]

O'Reilly RC, Frank MJ. (2006). Making working memory work: a computational model of learning in the prefrontal cortex and basal ganglia. Neural computation. 18 [PubMed]

O`Reilly RC, Frank MJ. (2005). Making Working Memory Work: A Computational Model of Learning in the Prefrontal Cortex and Basal Ganglia Neural Comput. 18

Porr B, Wörgötter F. (2006). Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only. Neural computation. 18 [PubMed]

Porr B, Wörgötter F. (2007). Learning with "relevance": using a third factor to stabilize Hebbian learning. Neural computation. 19 [PubMed]

Saudargiene A, Porr B, Wörgötter F. (2004). How the shape of pre- and postsynaptic signals can influence STDP: a biophysical model. Neural computation. 16 [PubMed]

Smith AJ, Becker S, Kapur S. (2005). A computational model of the functional role of the ventral-striatal D2 receptor in the expression of previously acquired behaviors. Neural computation. 17 [PubMed]

Stone JV, Jupp PE. (2007). Free-lunch learning: modeling spontaneous recovery of memory. Neural computation. 19 [PubMed]

Wörgötter F, Porr B. (2005). Temporal sequence learning, prediction, and control: a review of different models and their relation to biological mechanisms. Neural computation. 17 [PubMed]

This website requires cookies and limited processing of your personal data in order to function. By continuing to browse or otherwise use this site, you are agreeing to this use. See our Privacy policy and how to cite and terms of use.