Reinforcement Learning from Human Feedback

The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Information technology - software - natural language processing - artificial intelligence - chatbots - training.

(RLHF).

Reinforcement Learning from Human Feedback is a training process for machine learning.

It uses human feedback, or human preferences, to rank - or score - instances of the behaviour or output from the system being trained, for example ChatGPT.

The human-supervised RLHF supplements an initial period of unsupervised training known as generative pre-training.

Other resource

Improving Language Understanding by Generative Pre-Training, Radford, Narasimhan, Salimans & Sutskever, 2018

Reinforcement Learning from Human Feedback

See also

Other resource

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

external links

Tools

Print/export