Daw ND, Courville AC, Touretzky DS. (2006). Representation and timing in theories of the dopamine system. Neural computation. 18 [PubMed]
Fujita H, Ishii S. (2007). Model-based reinforcement learning for partially observable games with sampling-based state estimation. Neural computation. 19 [PubMed]