Singh SP, Jordan MI, Jaakkola T. (1995). On the convergence of stochastic iterative dynamic programming algorithms Neural Comput. 6

See more from authors: Singh SP · Jordan MI · Jaakkola T

References and models cited by this paper
References and models that cite this paper

Wörgötter F, Porr B. (2005). Temporal sequence learning, prediction, and control: a review of different models and their relation to biological mechanisms. Neural computation. 17 [PubMed]

This website requires cookies and limited processing of your personal data in order to function. By continuing to browse or otherwise use this site, you are agreeing to this use. See our Privacy policy and how to cite and terms of use.