Data lake and Generative pre-trained transformer: Difference between pages

From ACT Wiki
(Difference between pages)
Jump to navigationJump to search
(Create page - source - Oracle - https://www.oracle.com/uk/big-data/data-lake/what-is-data-lake/)
 
(Add link.)
 
Line 1: Line 1:
''Information - information technology - data.''
''Information technology - software - natural language processing - artificial intelligence - chatbots.''


A data lake is a repository for a large amount of data, either entirely unstructured, or including both structured and unstructured data.
(GPT).
 
Generative pre-trained transformers are language models that have been pre-trained on large datasets of unlabelled natural language text.
 
They can generate new text that is human-like, and in some cases may be difficult to distinguish from human-written text.
 
 
GPT's unsupervised pre-training may then often be supplemented by additional fine-tuning human supervised training, known as Reinforcement Learning from Human Feedback (RLHF).




== See also ==
== See also ==
* [[Big data]]
* [[Artificial intelligence]] (AI)
* [[Data]]
* [[Bot]]
* [[Data breach]]
* [[Chatbot]]
* [[Data cleansing]]
* [[ChatGPT]]
* [[Data exchange]]
* [[Enterprise-wide resource planning system]]
* [[Data matching]]
* [[Generative AI]] (GenAI)
* [[Data mining]]
* [[GPT-4]]
* [[Data presentation]]
* [[Information technology]]
* [[Data protection ]]
* [[Large language model]] (LLM)
* [[Data Protection Act]]
* [[Natural language]]
* [[Data Protection Authority]]
* [[Natural language processing]]
* [[Data Protection Officer]]
* [[Reinforcement Learning from Human Feedback]] (RLHF)
* [[Data science]]
* [[Robotics]]
* [[Data scraping]]
*[[Software]]
* [[Database]]
* [[Software robot]]
* [[Datasphere]]
 
* [[Electronic data interchange]]  (EDI)
 
* [[General Data Protection Regulation]]  (GDPR)
==Other resource==
* [[Governance]]
*[https://cdn.openai.com/research-covers/language-unsupervised/language_understanding_paper.pdf Improving Language Understanding by Generative Pre-Training, Radford, Narasimhan, Salimans & Sutskever, 2018]
* [[Knowledge discovery in data]]


[[Category:Identify_and_assess_risks]]
[[Category:Manage_risks]]
[[Category:Risk_reporting]]
[[Category:Risk_frameworks]]
[[Category:The_business_context]]
[[Category:The_business_context]]
[[Category:Treasury_operations]]
[[Category:Treasury_operations_infrastructure]]

Latest revision as of 21:26, 4 October 2023

Information technology - software - natural language processing - artificial intelligence - chatbots.

(GPT).

Generative pre-trained transformers are language models that have been pre-trained on large datasets of unlabelled natural language text.

They can generate new text that is human-like, and in some cases may be difficult to distinguish from human-written text.


GPT's unsupervised pre-training may then often be supplemented by additional fine-tuning human supervised training, known as Reinforcement Learning from Human Feedback (RLHF).


See also


Other resource