Skip to main content

Reinforcement Learning from Human Feedback

Draft notes on Reinforcement Learning from Human Feedback.

Draft — not yet written.

Overview

Background

Key Ideas

Further Reading