RLHF

From ACT Wiki
Revision as of 09:52, 20 April 2023 by imported>Doug Williamson (Create page. Sources: Linked pages.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Information technology - software - natural language processing - artificial intelligence - chatbots - training.

Reinforcement Learning from Human Feedback, a training process for machine learning.


See also