RLHF: Difference between revisions

From ACT Wiki
Jump to navigationJump to search
imported>Doug Williamson
(Create page. Sources: Linked pages.)
(No difference)

Revision as of 09:52, 20 April 2023

Information technology - software - natural language processing - artificial intelligence - chatbots - training.

Reinforcement Learning from Human Feedback, a training process for machine learning.


See also