Sutton RS, Barto A. (1987). A temporal-difference model of classical conditioning Proceedings of the Ninth Annual Conference of the Cognitive Science Society.

See more from authors: Sutton RS · Barto A

References and models cited by this paper
References and models that cite this paper

Montague PR, Dayan P, Sejnowski TJ. (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. The Journal of neuroscience : the official journal of the Society for Neuroscience. 16 [PubMed]

Porr B, Wörgötter F. (2006). Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only. Neural computation. 18 [PubMed]

This website requires cookies and limited processing of your personal data in order to function. By continuing to browse or otherwise use this site, you are agreeing to this use. See our Privacy policy and how to cite and terms of use.