Anderson CW et al. (2001). Robust reinforcement learning control with static and dynamic stability International Journal Of Robust And Nonlinear Control. 11
Atkeson CG, Morimoto J. (2003). Nonparametric representation of policies and value functions: A trajectory-based approach Advances in neural information processing systems. 15
Atkeson CG, Morimoto J. (2003). Minimax differential dynamic programming: An application to robust biped walking Advances in neural information processing systems. 15
Barto AG, Singh SP, Connolly C, Grupen R. (1994). Robust reinforcement learning in motion planning Advances in neural information processing systems. 6
Barto AG, Sutton RS, Anderson CW. (1983). Neuronlike elements that can solve difficult learning control problems IEEE Trans Systems Man Cybern. 13
Coraluppi SP, Marcus SI. (1999). Risk-sensitive and minmax control of discrete-time finite-state Markov decision processes Automatica. 35
Doya K. (2000). Reinforcement learning in continuous time and space. Neural computation. 12 [PubMed]
Doya K, Morimoto J. (2000). Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning Proceedings of Seventeenth International Conference on Machine Learning.
Doya K, Morimoto J. (2001). Acquisition of stand-up behavior by a real robot using hierarchical reinforcement learning Robotics And Autonomous Systems. 36
Doya K, Morimoto J. (2001). Robust reinforcement learning Advances in neural information processing systems. 13
Ghaoui LE, Nilim A. (2004). Robustness in Markov decision problems with uncertain transition matrices Advances in neural information processing systems. 16
Gullapalli V. (1990). A stochastic reinforcement learning algorithm for learning real-valued functions Neural Networks. 3
Imafuku K. (1999). Singularities of nonlinear control systems designed by Hamilton-Jacobi equations Unpublished doctoral dissertation.
Klopf AH, Harmon ME, Baird_III LC. (1995). Advantage updating applied to a differential game Advances in neural information processing systems. 7
Littman ML. (1994). Markov games as a framework for multi-agent reinforcement learning Proceedings of the Eleventh International Conference on Machine Learning.
Littman ML. (2001). Value-function reinforcement learning in Markov games Journal Of Cognitive Systems Research. 2
Neuneier R, Mihatsch O. (1998). Risk sensitive reinforcement learning Advances in neural information processing systems. 11
Tesauro G. (1992). Practical issues in temporal difference learning Mach Learn. 8
Weiland S. (1989). Linear quadratic games, H, and the Riccati equation Proceedings of the Workshop on the Riccati Equation in Control, Systems, and Signals.
Yoshioka T, Ishii S. (1998). Strategy acquisition for the game fiothellofi based on reinforcement learning International Conference on Neural Information Processing.
Zhou K, Doyle JC, Glover K. (1996). Robust and Optimal Control.