RLHF: Difference between revisions

From ACT Wiki
Jump to navigationJump to search
imported>Doug Williamson
(Create page. Sources: Linked pages.)
 
imported>Doug Williamson
(Add link.)
 
Line 12: Line 12:
* [[Machine learning]]
* [[Machine learning]]
* [[Natural language]]
* [[Natural language]]
* [[Natural language processing]]
* [[Natural language processing]] (NLP)
* [[Reinforcement Learning from Human Feedback]]
* [[Reinforcement Learning from Human Feedback]]
* [[Software]]


[[Category:The_business_context]]
[[Category:The_business_context]]

Latest revision as of 15:02, 21 April 2023

Information technology - software - natural language processing - artificial intelligence - chatbots - training.

Reinforcement Learning from Human Feedback, a training process for machine learning.


See also