RLHF

From ACT Wiki
Revision as of 15:02, 21 April 2023 by imported>Doug Williamson (Add link.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Information technology - software - natural language processing - artificial intelligence - chatbots - training.

Reinforcement Learning from Human Feedback, a training process for machine learning.


See also