RLHF: Difference between revisions
From ACT Wiki
Jump to navigationJump to search
imported>Doug Williamson (Create page. Sources: Linked pages.) |
(No difference)
|
Revision as of 09:52, 20 April 2023
Information technology - software - natural language processing - artificial intelligence - chatbots - training.
Reinforcement Learning from Human Feedback, a training process for machine learning.