Dayan P, Watkins C. (1992). Q-learning Mach Learn. 8

See more from authors: Dayan P · Watkins C

References and models cited by this paper
References and models that cite this paper

Fiore VG, Ognibene D, Adinoff B, Gu X. (2018). A Multilevel Computational Characterization of Endophenotypes in Addiction eNeuro.

Fujita H, Ishii S. (2007). Model-based reinforcement learning for partially observable games with sampling-based state estimation. Neural computation. 19 [PubMed]

Porr B, Wörgötter F. (2006). Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only. Neural computation. 18 [PubMed]

Porr B, Wörgötter F. (2007). Learning with "relevance": using a third factor to stabilize Hebbian learning. Neural computation. 19 [PubMed]

This website requires cookies and limited processing of your personal data in order to function. By continuing to browse or otherwise use this site, you are agreeing to this use. See our Privacy policy and how to cite and terms of use.