Carnegie Mellon Robotics Institute
Geoffrey Gordon
VFA workshop at ML-95, 1995.
| Download |
|
| Abstract |
| My paper in the main portion of the conference deals with fitted value iteration or Q-learning for offline problems, {\em i.e.}, those where we have a model of the environment so that we can examine arbitrary transitions in arbitrary order. The same techniques also allow us to do Q-learning for an online problem, {\em i.e.}, one where we have no model but must instead perform experiments inside the MDP to gather data. I will describe how. |
| Notes |
Associated Lab(s) / Group(s):
Auton Lab Associated Project(s):
Auton Project |
| Text Reference |
| Geoffrey Gordon, "Online Fitted Reinforcement Learning," VFA workshop at ML-95, 1995. |
| BibTeX Reference |
|
@inproceedings{Gordon_1995_2892, author = "Geoffrey Gordon", title = "Online Fitted Reinforcement Learning", booktitle = "VFA workshop at ML-95", year = "1995", } |
| The Robotics Institute is part of the School of Computer Science, Carnegie Mellon University. Contact Us | Update Instructions |