Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 204 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 204 Bytes

robot-rlhf

Robot Learning through Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.