Andreas Vlachos

R255 Imitation Learning

Imitation learning was initially proposed in robotics as a way to better robots (Schaal, 1999). The connecting theme is to combine the reward function in the end of the action sequence with demonstrations of the task in hand by an expert. Since then it has been applied to a number of tasks which can be modelled as a sequence of actions taken by an agent. These include the video game agents, moving cameras to track players and structured prediction in various tasks in natural language processing. For a recent tutorial see here.

Over the years there has been a number of algorithms proposed, in the literature but without necessarily making the connections between the various approaches clear. The initial lecture will set the criteria to be used to examine the algorithms with.

The papers presented in the 2023 version of the topic were:

Search-based Structured Prediction Hal Daumé III, John Langford and Daniel Marcu Machine Learning Journal (MLJ), 2009
A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning Stephane Ross, Geoffrey J. Gordon, J. Andrew Bagnell Artificial Intelligence and Statistics Conference (AISTATS), 2011
Sequence Level Training with Recurrent Neural Networks Marc’Aurelio Ranzato, Sumit Chopra, Michael Auli, Wojciech Zaremba International Conference on Machine Learning (ICLR), 2016
Generative Adversarial Imitation Learning Jonathan Ho and Stefano Ermon Neural Information Processing Systems (NeurIPS) 2016
One-Shot Imitation Learning Yan Duan, Marcin Andrychowicz, Bradly Stadie, Jonathan Ho, Jonas Schneider, Ilya Sutskever, Pieter Abbeel, Wojciech Zaremba Neural Information Processing Systems (NeurIPS) 2017
Disagreement-Regularized Imitation Learning Kiante Brantley, Wen Sun, Mikael Henaff Eighth International Conference on Learning Representations (ICLR), April 2020
Demonstrate Once, Imitate Immediately (DOME): Learning Visual Servoing for One-Shot Imitation Learning Eugene Valassakis, Georgios Papagiannis, Norman Di Palo, and Edward Johns International Conference on Intelligent Robots and Systems (IROS), 2022