RLHF

From ACT Wiki
Revision as of 22:23, 11 May 2024 by Doug (talk | contribs) (Add link.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Information technology - software - natural language processing - artificial intelligence - chatbots - training.

Reinforcement Learning from Human Feedback, a training process for machine learning.


See also