Kaelbling LP, Littman ML, Moore AW. (1996). Reinforcement learning: A survey J Art Intell Res. 4

See more from authors: Kaelbling LP · Littman ML · Moore AW

References and models cited by this paper
References and models that cite this paper

Fujita H, Ishii S. (2007). Model-based reinforcement learning for partially observable games with sampling-based state estimation. Neural computation. 19 [PubMed]

Low KH, Leow WK, Ang MH Jr. (2005). An Ensemble of Cooperative Extended Kohonen Maps for Complex Robot Motion Tasks Neural Comput. 17

Porr B, Wörgötter F. (2006). Strongly improved stability and faster convergence of temporal sequence learning by using input correlations only. Neural computation. 18 [PubMed]

Smith AJ, Becker S, Kapur S. (2005). A computational model of the functional role of the ventral-striatal D2 receptor in the expression of previously acquired behaviors. Neural computation. 17 [PubMed]

Wörgötter F, Porr B. (2005). Temporal sequence learning, prediction, and control: a review of different models and their relation to biological mechanisms. Neural computation. 17 [PubMed]

This website requires cookies and limited processing of your personal data in order to function. By continuing to browse or otherwise use this site, you are agreeing to this use. See our Privacy policy and how to cite and terms of use.