Data-Efficient Learning from Human Interventions for Mobile Robots

Zhenghao Peng , Zhizheng Liu , Bolei Zhou
University of California, Los Angeles

TL; DR

We train two mobile robots in real world in real time :robot: via human-in-the-loop learning! Our method:

:star2: Learns from online human intervention and demonstration!

:star2: Trains from scratch, without reward!

Safe Navigation

A delivery robot (Unitree Go2) learns to navigate safely in a real-world environment, avoiding collisions with static or dynamic obstacles. Even though the observation is RGB-D image, training can be done in 20 minutes and the robot can generalize to unseen environments.

Human Following

We train a quadruped robot (Unitree Go2) to follow a human subject in a real-world environment. Training is completed within 10 minutes.

Zero-shot Deployment

We have successfully deployed the learned policies to unseen environments.

Safe Navigation:


Human Following:

Full Training Footage