Uncertainty in Artificial Intelligence
First Name   Last Name   Password   Forgot Password   Log in!
    Proceedings   Proceeding details   Article details         Authors         Search    
Model Regularization for Stable Sample Rollouts
Erik Talvitie
When an imperfect model is used to generate sample rollouts, its errors tend to compound ‚?? a flawed sample is given as input to the model, which causes more errors, and so on. This presents a barrier to applying rollout-based plan- ning algorithms to learned models. To ad- dress this issue, a training methodology called ‚??hallucinated replay‚?Ě is introduced, which adds samples from the model into the training data, thereby training the model to produce sensible predictions when its own samples are given as input. Capabilities and limitations of this ap- proach are studied empirically. In several exam- ples hallucinated replay allows effective planning with imperfect models while models trained us- ing only real experience fail dramatically.
Pages: 780-789
PS Link:
PDF Link: /papers/14/p780-talvitie.pdf
AUTHOR = "Erik Talvitie ",
TITLE = "Model Regularization for Stable Sample Rollouts",
BOOKTITLE = "Proceedings of the Thirtieth Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-14)",
ADDRESS = "Corvallis, Oregon",
YEAR = "2014",
PAGES = "780--789"

hosted by DSL   •   site info   •   help