RLHF: Difference between revisions
From ACT Wiki
Jump to navigationJump to search
imported>Doug Williamson (Add link.) |
(Add link.) |
||
Line 9: | Line 9: | ||
* [[ChatGPT]] | * [[ChatGPT]] | ||
* [[Generative pre-trained transformer]] (GPT) | * [[Generative pre-trained transformer]] (GPT) | ||
* [[Google Gemini]] | |||
* [[Information technology]] | * [[Information technology]] | ||
* [[Machine learning]] | * [[Machine learning]] | ||
Line 16: | Line 17: | ||
* [[Software]] | * [[Software]] | ||
[[Category:Identify_and_assess_risks]] | [[Category:Identify_and_assess_risks]] | ||
[[Category:Manage_risks]] | [[Category:Manage_risks]] | ||
[[Category:Risk_reporting]] | |||
[[Category:Risk_frameworks]] | [[Category:Risk_frameworks]] | ||
[[Category: | [[Category:The_business_context]] | ||
Latest revision as of 22:23, 11 May 2024
Information technology - software - natural language processing - artificial intelligence - chatbots - training.
Reinforcement Learning from Human Feedback, a training process for machine learning.