RLHF: Difference between revisions

Latest revision as of 22:23, 11 May 2024

Information technology - software - natural language processing - artificial intelligence - chatbots - training.

Reinforcement Learning from Human Feedback, a training process for machine learning.

@@ Line 9: / Line 9: @@
 * [[ChatGPT]]
 * [[Generative pre-trained transformer]]  (GPT)
+* [[Google Gemini]]
 * [[Information technology]]
 * [[Machine learning]]
 * [[Natural language]]
-* [[Natural language processing]]
+* [[Natural language processing]]  (NLP)
 * [[Reinforcement Learning from Human Feedback]]
+* [[Software]]
-[[Category:The_business_context]]
 [[Category:Identify_and_assess_risks]]
 [[Category:Manage_risks]]
+[[Category:Risk_reporting]]
 [[Category:Risk_frameworks]]
-[[Category:Risk_reporting]]
+[[Category:The_business_context]]
-[[Category:Technology]]