Singh SP, Loch J. (1998). Using eligibility traces to find the best memoryless policy in partially observable Markov decision processes Proc 15th Intl Conf Mach Learn.
This website requires cookies and limited processing of your personal data in order to function. By continuing to browse
or otherwise use this site, you are agreeing to this use. See our Privacy policy and
how to cite and terms of use.