Das T, Gosavi A, Mahadevan S, Marchalleck N. (1997). Self-improving factory simulation using continuous-time average-reward reinforcement learning Proceedings of the 14th International Conference on Machine Learning.
See more from authors:
Das T
·
Gosavi A
·
Mahadevan S
·
Marchalleck N
This website requires cookies and limited processing of your personal data in order to function. By continuing to browse
or otherwise use this site, you are agreeing to this use. See our Privacy policy and
how to cite and terms of use.