Reinforcement Learning from Human Feedback Basics

Chapter Contents

Problem Setup

This chapter includes all the definitions, symbols, and operatings frequently used in the RLHF process.

ML Definitions

\[ D_{KL}(P || Q) = \sum_{x \in \mathcal{X}} P(x) \log \left(\frac{P(x)}{Q(x)}\right) \]

NLP Definitions

RL Definitions