Kaelbling LP, Littman ML, Cassandra AR. (1998). Planning and acting in partially observable stochastic domains Art Intell. 101

See more from authors: Kaelbling LP · Littman ML · Cassandra AR

References and models cited by this paper
References and models that cite this paper

Daw ND, Courville AC, Touretzky DS. (2006). Representation and timing in theories of the dopamine system. Neural computation. 18 [PubMed]

Fujita H, Ishii S. (2007). Model-based reinforcement learning for partially observable games with sampling-based state estimation. Neural computation. 19 [PubMed]

This website requires cookies and limited processing of your personal data in order to function. By continuing to browse or otherwise use this site, you are agreeing to this use. See our Privacy policy and how to cite and terms of use.