LLM Training - A simple visual guide beginners
LLM Training - A simple visual guide beginners
PRESENTS:
COFFEE BREAK
CONCEPTS
@MASTERING-LLM-
LARGE-LANGUAGE-
MODEL
Step 1 : Pre-training
Step 1 is to train a model on a massive dataset
from the internet to predict the next word -
This is usually called as Language Model
01
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Cool so i can use this model?
Not Yet
In step 1, the model understands how to
predict the next word but doesn't
understand any instructions.
03
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
I got a model now? Wait not
yet. Lets look into below
senarios
The Instruction models (SFT) are not helpful,
honest ,and harmless (HHH), we need to teach
them this so that they learn to respond with
HHH
SOURCE
04
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Step 3 : RLHF
We need to teach the model the human
preferences and focus on being helpful,
honest and harmless (HHH)
In this step, model is asked to generate multiple outputs
and humans will rank this output from best to worst.
05
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Final Model
In final step:
The instruction model is used to
generate an answer
Once the answer is generated, the
reward model (Replacement of humans)
will generate a score.
This score is used to improve the output
until the desired accuracy or number of
iteration is reached.
06
@MASTERING-LLM-LARGE-LANGUAGE-MODEL
Summary
Language model just understands how
to predict next words.
LLM Interview
Course
Want to Prepare yourself for an
LLM Interview?
100+ Questions spanning 14 categories
AgenticRAG with
LlamaIndex
Want to learn why AgenticRAG is
future of RAG?
Master RAG fundamentals through practical
case studies