RLHF: Difference between revisions

From ACT Wiki
Jump to navigationJump to search
imported>Doug Williamson
(Create page. Sources: Linked pages.)
 
(Add link.)
 
(One intermediate revision by one other user not shown)
Line 9: Line 9:
* [[ChatGPT]]
* [[ChatGPT]]
* [[Generative pre-trained transformer]]  (GPT)
* [[Generative pre-trained transformer]]  (GPT)
* [[Google Gemini]]
* [[Information technology]]
* [[Information technology]]
* [[Machine learning]]
* [[Machine learning]]
* [[Natural language]]
* [[Natural language]]
* [[Natural language processing]]
* [[Natural language processing]] (NLP)
* [[Reinforcement Learning from Human Feedback]]
* [[Reinforcement Learning from Human Feedback]]
* [[Software]]


[[Category:The_business_context]]
[[Category:Identify_and_assess_risks]]
[[Category:Identify_and_assess_risks]]
[[Category:Manage_risks]]
[[Category:Manage_risks]]
[[Category:Risk_reporting]]
[[Category:Risk_frameworks]]
[[Category:Risk_frameworks]]
[[Category:Risk_reporting]]
[[Category:The_business_context]]
[[Category:Technology]]

Latest revision as of 22:23, 11 May 2024

Information technology - software - natural language processing - artificial intelligence - chatbots - training.

Reinforcement Learning from Human Feedback, a training process for machine learning.


See also