LLM Training - A simple visual guide beginners

LLM Training - A simple visual guide for guidance for beginners

Uploaded by

vishu sablok

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views

LLM Training - A simple visual guide beginners

LLM Training - A simple visual guide for guidance for beginners

Uploaded by

vishu sablok

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

MASTERING LLM

PRESENTS:
COFFEE BREAK
CONCEPTS

How LLMs are trained?

A simple guide to
understand LLM
Training

@MASTERING-LLM-
LARGE-LANGUAGE-
MODEL
Step 1 : Pre-training
Step 1 is to train a model on a massive dataset
from the internet to predict the next word -
This is usually called as Language Model

01
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Cool so i can use this model?
Not Yet
In step 1, the model understands how to
predict the next word but doesn't
understand any instructions.

Model just completes next words

02
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Step 2 : Supervised fine-tuning
(SFT) or instruction tuning
We need to teach the model now to
understand specific instructions, step 2
helps model learn instructions.

03
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
I got a model now? Wait not
yet. Lets look into below
senarios
The Instruction models (SFT) are not helpful,
honest ,and harmless (HHH), we need to teach
them this so that they learn to respond with
HHH

SOURCE

04
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Step 3 : RLHF
We need to teach the model the human
preferences and focus on being helpful,
honest and harmless (HHH)
In this step, model is asked to generate multiple outputs
and humans will rank this output from best to worst.

The simple goal of RLHF is to replace

human feedback with a model which
understands human preferences.

05
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Final Model
In final step:
The instruction model is used to
generate an answer
Once the answer is generated, the
reward model (Replacement of humans)
will generate a score.
This score is used to improve the output
until the desired accuracy or number of
iteration is reached.

06
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Summary
Language model just understands how
to predict next words.

SFT or instruction tuning teaches model

on how to follow the instructions on
multiple different tasks.

RLHF helps more improve answers on

human preferences like helpful, honest
and harmless (HHH)
Check this paper to learn more about
LLM alignments

New alignment methods include

methods like DPO which we will cover
soon.

Comment below on which topic you

want to understand next in this "Coffee
Break Concepts" series and we will
include those topics in the upcoming
weeks
07
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
www.masteringllm.com

LLM Interview
Course
Want to Prepare yourself for an
LLM Interview?
100+ Questions spanning 14 categories

Curated 100+ assessments for each

Well-researched real-world interview

questions based on FAANG & Fortune
500 companies
Focus on Visual learning
Real Case Studies & Certification

Coupon Code - LLM50

Coupon is valid till 31th Jan 2024
www.masteringllm.com

AgenticRAG with
LlamaIndex
Want to learn why AgenticRAG is
future of RAG?
Master RAG fundamentals through practical
case studies

Understand how to overcome limitations of RAG

Introduction to AgenticRAG & techniques like

Routing Agents, Query planning agents,
Structure planning agents, and React agents
with human in loop.

5 real-time case studies with code

walkthroughs

AIF C01 Study Guide
100% (1)
AIF C01 Study Guide
28 pages
Aws Certified Ai Practitioner Aif c01 (1)
No ratings yet
Aws Certified Ai Practitioner Aif c01 (1)
40 pages
Ebook Prompt Engineering 101
100% (3)
Ebook Prompt Engineering 101
26 pages
NLP Module 8 Meta Programs
100% (10)
NLP Module 8 Meta Programs
16 pages
Sinan Ozdemir - Quick Start Guide to Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide to Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
LLMs in Production-MLC - GRC
No ratings yet
LLMs in Production-MLC - GRC
39 pages
How LLMs Are Trained a Simplified for LLM Training 1730307343
No ratings yet
How LLMs Are Trained a Simplified for LLM Training 1730307343
9 pages
How LLMs Are Trained - A Simple Guide
No ratings yet
How LLMs Are Trained - A Simple Guide
9 pages
Transfer of Training
No ratings yet
Transfer of Training
8 pages
fine tuning llm
No ratings yet
fine tuning llm
6 pages
LLM Questions
No ratings yet
LLM Questions
51 pages
MODULE 3
No ratings yet
MODULE 3
43 pages
Large Language Model Lifecycle
No ratings yet
Large Language Model Lifecycle
2 pages
DPO
No ratings yet
DPO
14 pages
Prompt Guide
No ratings yet
Prompt Guide
7 pages
Pytoch Modeling
No ratings yet
Pytoch Modeling
16 pages
Lakera - Ai-The Ultimate Guide To LLM Fine Tuning Best Practices Amp Tools
100% (1)
Lakera - Ai-The Ultimate Guide To LLM Fine Tuning Best Practices Amp Tools
13 pages
Module 3
No ratings yet
Module 3
44 pages
Direct Preference Optimization: Your Language Model Is Secretly A Reward Model
No ratings yet
Direct Preference Optimization: Your Language Model Is Secretly A Reward Model
27 pages
[English] Introduction to Large Language Models [DownSub.com]
No ratings yet
[English] Introduction to Large Language Models [DownSub.com]
9 pages
Generative AI Project Lifecycle
No ratings yet
Generative AI Project Lifecycle
2 pages
Model Alignment and In-Context Learning
No ratings yet
Model Alignment and In-Context Learning
16 pages
Generative Ai Terminology
100% (1)
Generative Ai Terminology
26 pages
dpo-dual
No ratings yet
dpo-dual
54 pages
Chapter 1 Introduction Lifelong Learning
No ratings yet
Chapter 1 Introduction Lifelong Learning
19 pages
What Is Jailbreaking & To Jailbreak LLMS: 3 Techniques
No ratings yet
What Is Jailbreaking & To Jailbreak LLMS: 3 Techniques
11 pages
LLMOps for LLM Models
No ratings yet
LLMOps for LLM Models
7 pages
2.9 How LLMs follow instructions, Instruction tuning and RLHF
No ratings yet
2.9 How LLMs follow instructions, Instruction tuning and RLHF
2 pages
Week2 Llms
No ratings yet
Week2 Llms
25 pages
Advanced Prompt Engineering Techniques
No ratings yet
Advanced Prompt Engineering Techniques
2 pages
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
No ratings yet
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
285 pages
Machine Learning Process Lifecycle: Talat@amii - Ca Luke@amii - Ca Shazan@amii - Ca Sankalp@amii - Ca
No ratings yet
Machine Learning Process Lifecycle: Talat@amii - Ca Luke@amii - Ca Shazan@amii - Ca Sankalp@amii - Ca
13 pages
Platypus: How To Refine LLMs With Human Feedback
No ratings yet
Platypus: How To Refine LLMs With Human Feedback
7 pages
Exploring Large Language Models. Language Underst
No ratings yet
Exploring Large Language Models. Language Underst
48 pages
Deepset NLP For Product Managers Ebook
No ratings yet
Deepset NLP For Product Managers Ebook
22 pages
Prompt Engineering Guide
No ratings yet
Prompt Engineering Guide
122 pages
Transfer of Training Handout
No ratings yet
Transfer of Training Handout
5 pages
Gen AI Assignment
No ratings yet
Gen AI Assignment
5 pages
DeepSeek-R1 : Enhanced Reasoning via Reinforcement Learning
No ratings yet
DeepSeek-R1 : Enhanced Reasoning via Reinforcement Learning
9 pages
01-Welcome To Language Guru Mastering The Meta Model
No ratings yet
01-Welcome To Language Guru Mastering The Meta Model
2 pages
Overview Gemini App
No ratings yet
Overview Gemini App
7 pages
What We Learned From A Year of Building With LLMs (Part I) - O'Reilly
No ratings yet
What We Learned From A Year of Building With LLMs (Part I) - O'Reilly
22 pages
NLB Classifier
No ratings yet
NLB Classifier
14 pages
LLMs-2
No ratings yet
LLMs-2
6 pages
Entrep - MOD 7 Four Ms of Operation
No ratings yet
Entrep - MOD 7 Four Ms of Operation
17 pages
How To Choose The Right LMS Software For Your Institute? - EduVue
No ratings yet
How To Choose The Right LMS Software For Your Institute? - EduVue
6 pages
ALI 7 Steps
No ratings yet
ALI 7 Steps
19 pages
Ebook Prompt Engineering 101
No ratings yet
Ebook Prompt Engineering 101
25 pages
AI For Education RAG
No ratings yet
AI For Education RAG
18 pages
What We Learned from a Year of Building with LLMs (for True Epub) (Eugene Yan, Bryan Bischof, Charles Frye etc.)
No ratings yet
What We Learned from a Year of Building with LLMs (for True Epub) (Eugene Yan, Bryan Bischof, Charles Frye etc.)
90 pages
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
No ratings yet
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
325 pages
Implementing Agent Search Tree
No ratings yet
Implementing Agent Search Tree
11 pages
2017 Coach Essentials 1
100% (1)
2017 Coach Essentials 1
110 pages
LongChat-13B: An Open-Source Chatbot With 16k Tokens Memory
No ratings yet
LongChat-13B: An Open-Source Chatbot With 16k Tokens Memory
6 pages
Computer Assisted Learning
No ratings yet
Computer Assisted Learning
19 pages
HTML5 Planning For Multi-Device
No ratings yet
HTML5 Planning For Multi-Device
11 pages
Immediate download Solution Manual for Human Resource Management, 14th Edition all chapters
100% (4)
Immediate download Solution Manual for Human Resource Management, 14th Edition all chapters
59 pages
Summary Module 3
No ratings yet
Summary Module 3
2 pages
LR Training Manual
No ratings yet
LR Training Manual
3 pages
Hello 3
No ratings yet
Hello 3
1 page
notes funamental machine
No ratings yet
notes funamental machine
34 pages
Leveraging Language Models With RAG
No ratings yet
Leveraging Language Models With RAG
57 pages
Essential Guide to LLMOps: Implementing effective strategies for Large Language Models in deployment and continuous improvement
From Everand
Essential Guide to LLMOps: Implementing effective strategies for Large Language Models in deployment and continuous improvement
Ryan Doan
No ratings yet
12 LLM Notes
No ratings yet
12 LLM Notes
10 pages
KEL 346 OPEN AI Creating the Product Readmap for ChatGPT
No ratings yet
KEL 346 OPEN AI Creating the Product Readmap for ChatGPT
10 pages
61 The Pupil Becomes The Maste
No ratings yet
61 The Pupil Becomes The Maste
10 pages
Mahowald - Language and thought in LLMs (2024)
No ratings yet
Mahowald - Language and thought in LLMs (2024)
24 pages
Generative AI
No ratings yet
Generative AI
28 pages
Decoding ChatGPT A Primer On Large Language Models For Clinicians
No ratings yet
Decoding ChatGPT A Primer On Large Language Models For Clinicians
4 pages
Reinforcement Learning From Human Feedback
No ratings yet
Reinforcement Learning From Human Feedback
100 pages
Yandex NV - Nebius Group Investor Presentation
No ratings yet
Yandex NV - Nebius Group Investor Presentation
49 pages
David de Cremer - The AI-Savvy Leader_ Nine Ways to Take Back Control and Make AI Work-Harvard Business Review Press (2024)
No ratings yet
David de Cremer - The AI-Savvy Leader_ Nine Ways to Take Back Control and Make AI Work-Harvard Business Review Press (2024)
187 pages
Fine tuning
No ratings yet
Fine tuning
24 pages
Tips Reference 7
No ratings yet
Tips Reference 7
38 pages
LLM Training - A simple visual guide beginners
No ratings yet
LLM Training - A simple visual guide beginners
10 pages
Jason Weston Reasoning Alignment Berkeley Talk
No ratings yet
Jason Weston Reasoning Alignment Berkeley Talk
106 pages
Carta Et Al - 2023 - Grounding Large Language Models in Interactive Environments With Online
No ratings yet
Carta Et Al - 2023 - Grounding Large Language Models in Interactive Environments With Online
38 pages
Insights Into Alignment: Evaluating DPO and Its Variants Across Multiple Tasks
No ratings yet
Insights Into Alignment: Evaluating DPO and Its Variants Across Multiple Tasks
13 pages
LLM_book_43-102
No ratings yet
LLM_book_43-102
60 pages
deepseek.02495v1
No ratings yet
deepseek.02495v1
42 pages
Toc
No ratings yet
Toc
6 pages
Genai
No ratings yet
Genai
26 pages
Model Fine Tuning Documentation
No ratings yet
Model Fine Tuning Documentation
11 pages
A Survey of DeepSeek Models
No ratings yet
A Survey of DeepSeek Models
6 pages
Agbon, G. (2024)
No ratings yet
Agbon, G. (2024)
16 pages
The Era of Experience Paper
No ratings yet
The Era of Experience Paper
11 pages
DPO Vs PPO Comparative Analysis
No ratings yet
DPO Vs PPO Comparative Analysis
15 pages
Large AI Models in Health Informatics - Applications, Challenges, and The Future
No ratings yet
Large AI Models in Health Informatics - Applications, Challenges, and The Future
14 pages
Newwhitepaper - Operationalizing Generative AI On Vertex AI
No ratings yet
Newwhitepaper - Operationalizing Generative AI On Vertex AI
69 pages
Black-Box Prompt Optimization
No ratings yet
Black-Box Prompt Optimization
20 pages