- Gradient Descent Approaches to Neural-Net-Based Solutions of the Hamilton-Jacobi-Bellman Equation
Remi Munos, Leemon Baird, and Andrew Moore
International Joint Conference on Neural Networks, July, 1999. Details |
pdf (192KB) | Copyrighted
- Reinforcement Learning Through Gradient Descent
Leemon Baird
doctoral dissertation, tech. report CMU-CS-99-132, Computer Science Department, Carnegie Mellon University, May, 1999
Details |
pdf (244KB) | Copyrighted
- Gradient Descent for General Reinforcement Learning
Leemon Baird and Andrew Moore
Advances in
Neural Information Processing Systems 11, , 1999 Details |
pdf (48KB) | Copyrighted
- Multi-Value-Functions: Efficient Automatic Action Hierarchies for Multiple Goal MDPs
Andrew Moore, Leemon Baird, and Leslie Pack Kaelbling
Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI '99), 1999. Details |
pdf (414KB) | Copyrighted
|