Adversarial Imitation Learning with Preferences

Published in ICLR 2023, 2023

This paper extends Adversarial Imitation Learning to simultaneously utilize both demonstrations and preferences. We leverage the connection between discriminator training and density ratio estimation to incorporate preferences into the Adversarial Imitation Learning paradigm, enabling more flexible policy learning from multiple feedback types.

Recommended citation: Aleksandar Taranovic, Andras Gabor Kupcsik, Niklas Freymuth, Gerhard Neumann. (2023). "Adversarial Imitation Learning with Preferences." International Conference on Learning Representations (ICLR).
Download Paper

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)