Uncertainty in Artificial Intelligence
First Name   Last Name   Password   Forgot Password   Log in!
    Proceedings   Proceeding details   Article details         Authors         Search    
Active Imitation Learning via Reduction to I.I.D. Active Learning
Kshitij Judah, Alan Fern, Thomas Dietterich
Abstract:
In standard passive imitation learning, the goal is to learn a target policy by passively observing full execution trajectories of it. Unfortunately, generating such trajectories can require substantial expert effort and be impractical in some cases. In this paper, we consider active imitation learning with the goal of reducing this effort by querying the expert about the desired action at individual states, which are selected based on answers to past queries and the learner's interactions with an environment simulator. We introduce a new approach based on reducing active imitation learning to i.i.d. active learning, which can leverage progress in the i.i.d. setting. Our first contribution, is to analyze reductions for both non-stationary and stationary policies, showing that the label complexity (number of queries) of active imitation learning can be substantially less than passive learning. Our second contribution, is to introduce a practical algorithm inspired by the reductions, which is shown to be highly effective in four test domains compared to a number of alternatives.
Keywords:
Pages: 428-437
PS Link:
PDF Link: /papers/12/p428-judah.pdf
BibTex:
@INPROCEEDINGS{Judah12,
AUTHOR = "Kshitij Judah and Alan Fern and Thomas Dietterich",
TITLE = "Active Imitation Learning via Reduction to I.I.D. Active Learning",
BOOKTITLE = "Proceedings of the Twenty-Eighth Conference Annual Conference on Uncertainty in Artificial Intelligence (UAI-12)",
PUBLISHER = "AUAI Press",
ADDRESS = "Corvallis, Oregon",
YEAR = "2012",
PAGES = "428--437"
}


hosted by DSL   •   site info   •   help