Reinforcement Learning from Human Feedback Draft notes on Reinforcement Learning from Human Feedback. draft certainty: unlikely importance: 5 Overview Background Key Ideas Further Reading Draft — not yet written. Overview Background Key Ideas Further Reading