📚 Michele's Notes

❯

Scientific Literature References

❯

literature notes

❯

GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback

GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback

Jul 23, 20261 min read

year : 2023
authors : Jie Huang, Jiangshan Hao, Rongshun Juan, Randy Gomez, Keisuke Nakamura, Guangliang Li
repository :
proceedings : 2023 IEEE International Conference on Robotics and Automation (ICRA)
journal :
volume :
issue :
publisher :
doi : 10.1109/ICRA48891.2023.10160939
Abstract : Generative adversarial imitation learning (GAIL) — a general model-free imitation learning method, allows robots to directly learn policies from expert trajectories in large environments. However, GAIL shares the limitation of other imitation learning methods that they can seldom surpass the performance of demonstrations. In this paper, to address the limit of GAIL, we propose GAN-based interactive reinforcement learning (GAIRL) from demonstrations and human evaluative feedback, by combining the advantages of GAIL and interactive reinforcement learning. We test GAIRL in six physics-based control tasks, ranging from simple low-dimensional control tasks — Cart Pole, Mountain Car and Lunar Lander, to difficult high-dimensional tasks — Inverted Double Pendulum, Hopper and HalfCheetah. Our results suggest that, the GAIRL agent can generally surpass the performance of demonstrations in both low-dimensional and high-dimensional tasks and get an optimal or close to optimal policy.

research

GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback

Huang et al_2023_GAN-Based Interactive Reinforcement Learning from Demonstration and Human.pdf

Notes

Graph View

GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback
Notes

Backlinks

Reinforcement Learning with Human Feedback

Created with Quartz v4.4.0 © 2026

GitHub
Discord Community