Envisioning LogoEnvisioning
HomeServicesWorkSignalsVocabAbout
Vocab
1092 entries
GridGraphSunflowerInfo

Jan Leike

(1 article)
RLHF (Reinforcement Learning from Human Feedback)
2016

RLHF
Reinforcement Learning from Human Feedback

Technique that combines reinforcement learning (RL) with human feedback to guide the learning process towards desired outcomes.

Generality: 625