0% found this document useful (0 votes)

17 views

ml_cheatsheet

Uploaded by

Lenara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

ml_cheatsheet

Uploaded by

Lenara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Machine Learning Algorithms Cheat Sheet

Table of Contents

1. Supervised Learning
Linear Regression
Logistic Regression
Decision Trees
Random Forests
Support Vector Machines (SVM)
k-Nearest Neighbors (k-NN)
Naive Bayes
Gradient Boosting Machines (GBM)
Neural Networks
2. Unsupervised Learning
k-Means Clustering
Hierarchical Clustering
Principal Component Analysis (PCA)
Independent Component Analysis (ICA)
Association Rules
Autoencoders
3. Reinforcement Learning
Q-Learning
Deep Q-Networks (DQN)
Policy Gradients
Actor-Critic Methods
4. Semi-Supervised and Self-Supervised Learning
Self-Training
Co-Training
5. Ensemble Methods
Bagging
Boosting
Stacking

Supervised Learning

1. Linear Regression

Purpose: Predict continuous target variables.

Key Concept: Models the relationship between input features and output as a linear combination.
Equation: ( y = \beta_0 + \beta_1x_1 + \beta_2x_2 + \dots + \beta_nx_n + \epsilon )

2. Logistic Regression

Purpose: Binary classification.

Key Concept: Uses the logistic function to model the probability of a class.
Equation: ( P(Y=1) = \frac{1}{1 + e^{-(\beta_0 + \beta_1x_1 + \dots + \beta_nx_n)}} )

3. Decision Trees

Purpose: Classification and regression.

Key Concept: Splits data into subsets based on feature values.
Advantages: Easy to interpret, handles both numerical and categorical data.
Disadvantages: Prone to overfitting.

4. Random Forests

Purpose: Classification and regression.

Key Concept: Ensemble of decision trees using bagging.
Advantages: Reduces overfitting, handles large datasets well.
Key Parameters: Number of trees, max depth.

5. Support Vector Machines (SVM)

Purpose: Classification and regression.

Key Concept: Finds the hyperplane that best separates classes with maximum margin.
Kernel Trick: Enables handling non-linear relationships.
Common Kernels: Linear, Polynomial, RBF.

6. k-Nearest Neighbors (k-NN)

Purpose: Classification and regression.

Key Concept: Assigns the output based on the majority label of the k closest training examples.
Advantages: Simple, no training phase.
Disadvantages: Computationally intensive during prediction, sensitive to irrelevant features.

7. Naive Bayes

Purpose: Classification.
Key Concept: Based on Bayes' Theorem with the assumption of feature independence.
Variants: Gaussian, Multinomial, Bernoulli.

8. Gradient Boosting Machines (GBM)

Purpose: Classification and regression.

Key Concept: Builds models sequentially, each new model correcting errors of the previous ones.
Popular Implementations: XGBoost, LightGBM, CatBoost.
Advantages: High predictive performance, handles missing data.

9. Neural Networks

Purpose: Various tasks including classification, regression, and more.

Key Concept: Composed of layers of interconnected nodes (neurons) that can capture complex patterns.
Types: Feedforward Neural Networks, Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN).

Unsupervised Learning

1. k-Means Clustering

Purpose: Partition data into k distinct clusters.

Key Concept: Minimizes within-cluster variance.
Parameters: Number of clusters (k), distance metric.

2. Hierarchical Clustering

Purpose: Create a hierarchy of clusters.

Key Concept: Either agglomerative (bottom-up) or divisive (top-down).
Linkage Criteria: Single, complete, average, ward.

3. Principal Component Analysis (PCA)

Purpose: Dimensionality reduction.

Key Concept: Transforms data to a new coordinate system with orthogonal principal components.
Uses: Feature reduction, visualization.

4. Independent Component Analysis (ICA)

Purpose: Separate a multivariate signal into additive, independent components.

Key Concept: Maximizes statistical independence.

5. Association Rules

Purpose: Discover interesting relations between variables in large databases.

Key Concepts: Support, Confidence, Lift.
Algorithms: Apriori, Eclat.

6. Autoencoders

Purpose: Learn efficient codings of input data.

Key Concept: Neural network architecture with encoder and decoder parts.
Uses: Dimensionality reduction, anomaly detection.

Reinforcement Learning

1. Q-Learning

Purpose: Learn the value of actions in states to derive an optimal policy.

Key Concept: Off-policy temporal difference learning.
Equation: ( Q(s, a) \leftarrow Q(s, a) + \alpha [r + \gamma \max_{a'} Q(s', a') - Q(s, a)] )

2. Deep Q-Networks (DQN)

Purpose: Combine Q-Learning with deep neural networks.

Key Concept: Uses neural networks to approximate Q-values.
Features: Experience replay, target networks.

3. Policy Gradients

Purpose: Optimize the policy directly.

Key Concept: Uses gradient ascent on expected rewards.
Algorithms: REINFORCE, Proximal Policy Optimization (PPO).

4. Actor-Critic Methods

Purpose: Combine value-based and policy-based methods.

Key Concept: Actor updates policy, Critic evaluates it.
Examples: A3C, DDPG.

Semi-Supervised and Self-Supervised Learning

1. Self-Training

Purpose: Utilize unlabeled data to improve model performance.

Key Concept: Iteratively label unlabeled data using the current model.

2. Co-Training

Purpose: Use multiple views of data to train models.

Key Concept: Each model trains on different feature sets and labels unlabeled data for each other.

Ensemble Methods

1. Bagging (Bootstrap Aggregating)

Purpose: Reduce variance and prevent overfitting.

Key Concept: Train multiple models on different bootstrap samples and aggregate predictions.
Example: Random Forest.

2. Boosting

Purpose: Reduce bias and build strong predictive models.

Key Concept: Sequentially train models, each focusing on errors of the previous ones.
Examples: AdaBoost, Gradient Boosting, XGBoost.

3. Stacking

Purpose: Combine multiple models to improve performance.

Key Concept: Use a meta-model to aggregate predictions from base models.

Additional Algorithms and Techniques

Support Vector Regression (SVR)

Purpose: Regression using SVM principles.

Key Concept: Fits the best line within a predefined margin.

Elastic Net

Purpose: Regularized regression combining L1 and L2 penalties.

Key Concept: Balances between feature selection and coefficient shrinkage.

Gaussian Mixture Models (GMM)

Purpose: Probabilistic clustering.

Key Concept: Assumes data is generated from a mixture of several Gaussian distributions.

t-Distributed Stochastic Neighbor Embedding (t-SNE)

Purpose: Data visualization.
Key Concept: Reduces dimensions while preserving local structure.

Hidden Markov Models (HMM)

Purpose: Model sequential data.

Key Concept: States are hidden and emit observable events.

Key Concepts and Terms

Overfitting: Model performs well on training data but poorly on unseen data.
Underfitting: Model is too simple to capture underlying patterns.
Bias-Variance Tradeoff: Balance between model complexity and generalization.
Cross-Validation: Technique to assess model performance by partitioning data.
Regularization: Techniques to prevent overfitting (e.g., L1, L2).
Feature Scaling: Standardizing features to improve model performance.

Resources and Libraries

Python Libraries:

Scikit-learn: Comprehensive ML algorithms.

TensorFlow & Keras: Deep learning frameworks.
PyTorch: Flexible deep learning library.
XGBoost, LightGBM, CatBoost: Gradient boosting implementations.

Books:

"Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow" by Aurélien Géron
"Pattern Recognition and Machine Learning" by Christopher M. Bishop
"The Elements of Statistical Learning" by Trevor Hastie, Robert Tibshirani, and Jerome Friedman

Online Courses:

Coursera's Machine Learning by Andrew Ng

edX's MicroMasters in Statistics and Data Science
Udacity's Machine Learning Nanodegree

Lecture Notes on Machine Learning Concepts.docx
No ratings yet
Lecture Notes on Machine Learning Concepts.docx
5 pages
Supervised Learning Final With Diagrams Cleaned
No ratings yet
Supervised Learning Final With Diagrams Cleaned
7 pages
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-10-33
No ratings yet
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-10-33
24 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
CS480 Lecture November 14th
No ratings yet
CS480 Lecture November 14th
72 pages
ML
No ratings yet
ML
5 pages
ML Notes
No ratings yet
ML Notes
52 pages
21CS743 Model Question Paper Solution
No ratings yet
21CS743 Model Question Paper Solution
32 pages
UCS_401_Unit-LV_ Trends in Machine Learning_Model and Symbols- Bagging and Boosting, Multitask
No ratings yet
UCS_401_Unit-LV_ Trends in Machine Learning_Model and Symbols- Bagging and Boosting, Multitask
44 pages
1machine Learning
No ratings yet
1machine Learning
26 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
data science notes c
No ratings yet
data science notes c
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
CP Presentation Affan, Hammad, Arman, Shayan
No ratings yet
CP Presentation Affan, Hammad, Arman, Shayan
18 pages
What Are The Common Algorithms in Machine Learning
No ratings yet
What Are The Common Algorithms in Machine Learning
3 pages
Lecture 8
No ratings yet
Lecture 8
11 pages
Module 3
No ratings yet
Module 3
11 pages
Machine Learning Overview
No ratings yet
Machine Learning Overview
7 pages
Machine Learning Presentation
No ratings yet
Machine Learning Presentation
12 pages
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
No ratings yet
Kenny-230718-The Ultimate Machine Learning Cheat Sheet
20 pages
Introduction To Machine Learning PPT Main
No ratings yet
Introduction To Machine Learning PPT Main
15 pages
Machine Learning (R20a0518)
No ratings yet
Machine Learning (R20a0518)
87 pages
3.popular Machine Learning Algorithm
No ratings yet
3.popular Machine Learning Algorithm
11 pages
data science notes b
No ratings yet
data science notes b
5 pages
book of 843_AI_Student_HandbookXI-104-127
No ratings yet
book of 843_AI_Student_HandbookXI-104-127
24 pages
Chapter 01 machine learning
No ratings yet
Chapter 01 machine learning
22 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
ML
No ratings yet
ML
8 pages
ml1
No ratings yet
ml1
17 pages
AIML MODEL
No ratings yet
AIML MODEL
13 pages
AIYA SESSION 4
No ratings yet
AIYA SESSION 4
42 pages
Top 10 Machine Learning Algo PDF
No ratings yet
Top 10 Machine Learning Algo PDF
15 pages
Machine Learning: A Comprehensive Overview
No ratings yet
Machine Learning: A Comprehensive Overview
3 pages
ITA6016 - Machine Learning Introduction
No ratings yet
ITA6016 - Machine Learning Introduction
13 pages
Handout - BITS-F464 - Machine - Learning - August 2019
No ratings yet
Handout - BITS-F464 - Machine - Learning - August 2019
4 pages
Machine Learning
No ratings yet
Machine Learning
55 pages
21cs743 Model Question Paper Solution
No ratings yet
21cs743 Model Question Paper Solution
33 pages
All Machine Learning Algorithms Explained in One Line
No ratings yet
All Machine Learning Algorithms Explained in One Line
12 pages
Module 1 & 2
No ratings yet
Module 1 & 2
21 pages
ML assignment
No ratings yet
ML assignment
13 pages
MLSC Final Notes
No ratings yet
MLSC Final Notes
24 pages
CSC413 Lecture Note
No ratings yet
CSC413 Lecture Note
32 pages
Kavin
No ratings yet
Kavin
15 pages
21cs743 Solutions
No ratings yet
21cs743 Solutions
19 pages
AI unit 1
No ratings yet
AI unit 1
36 pages
ML CheatSheet
No ratings yet
ML CheatSheet
14 pages
ChatGPT - Machine Learning Overview
No ratings yet
ChatGPT - Machine Learning Overview
34 pages
UNIT-1,2,3
No ratings yet
UNIT-1,2,3
30 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
Lecture 01 Introducing ML 13102022 031101pm
No ratings yet
Lecture 01 Introducing ML 13102022 031101pm
36 pages
DL UNIT 1
No ratings yet
DL UNIT 1
21 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
ClassNote One
No ratings yet
ClassNote One
2 pages
Machine Learning Concept1
No ratings yet
Machine Learning Concept1
16 pages
computer network ppt file
No ratings yet
computer network ppt file
10 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
Python Machine Learning
From Everand
Python Machine Learning
Sebastian Raschka
4/5 (18)
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
sevcik-scales-and-chord-studies
No ratings yet
sevcik-scales-and-chord-studies
2 pages
IMSLP890186-SIBLEY1802 20835 642e-39087012336113pieces
No ratings yet
IMSLP890186-SIBLEY1802 20835 642e-39087012336113pieces
50 pages
Sprint Backlog
No ratings yet
Sprint Backlog
2 pages
Untitled
No ratings yet
Untitled
25 pages
Core Foundations
No ratings yet
Core Foundations
14 pages
Achieving Lean Data Science Agility Via Data Driven Scrum
No ratings yet
Achieving Lean Data Science Agility Via Data Driven Scrum
10 pages
Agile Data Science With R
No ratings yet
Agile Data Science With R
65 pages
Civil Engineer List
No ratings yet
Civil Engineer List
9 pages
Yellow Gray and Black Minimalist Industries Presentation
No ratings yet
Yellow Gray and Black Minimalist Industries Presentation
72 pages
HDPE Pipe - Time Estimate, Methods, Cost Comparison - EDDY Pump
No ratings yet
HDPE Pipe - Time Estimate, Methods, Cost Comparison - EDDY Pump
8 pages
Important Computer Awareness Quiz MCQ PDF For All Competitive Exams
No ratings yet
Important Computer Awareness Quiz MCQ PDF For All Competitive Exams
81 pages
11 Distribution System Load Characteristics (1172)
No ratings yet
11 Distribution System Load Characteristics (1172)
9 pages
Gypsum Calcium Sulfate
No ratings yet
Gypsum Calcium Sulfate
3 pages
Effect of Rainfall On Stability of Soil Slope
No ratings yet
Effect of Rainfall On Stability of Soil Slope
6 pages
Redp 5599
No ratings yet
Redp 5599
128 pages
Protein Extraction From Algae
No ratings yet
Protein Extraction From Algae
8 pages
Hac Hfw1509tm Il A Datasheet 20220817
No ratings yet
Hac Hfw1509tm Il A Datasheet 20220817
3 pages
Share MODULE 3 Physical Fitness Test
No ratings yet
Share MODULE 3 Physical Fitness Test
13 pages
Duroxite 100 Data Sheet en
No ratings yet
Duroxite 100 Data Sheet en
3 pages
Organizational Development: Books To Be Read
No ratings yet
Organizational Development: Books To Be Read
49 pages
Study Question Semantics Group
No ratings yet
Study Question Semantics Group
3 pages
DCHHS Administrative Manual 2022
No ratings yet
DCHHS Administrative Manual 2022
106 pages
Diplomacy MSIA 710 Jan-June Intake Coursework
No ratings yet
Diplomacy MSIA 710 Jan-June Intake Coursework
2 pages
Computer Studies Ii
No ratings yet
Computer Studies Ii
51 pages
Question Bank for Communicative English BENGK106
50% (4)
Question Bank for Communicative English BENGK106
26 pages
CW Elvee - Keyfob Monitoring 2023 - AF Citywalk Elvee UPDATED
No ratings yet
CW Elvee - Keyfob Monitoring 2023 - AF Citywalk Elvee UPDATED
14 pages
CIS Amazon Web Services Foundations Benchmark v1.3.0 DRAFT 24JUL20
No ratings yet
CIS Amazon Web Services Foundations Benchmark v1.3.0 DRAFT 24JUL20
192 pages
Converting Steel Weldments To Ductile Iron Castings
No ratings yet
Converting Steel Weldments To Ductile Iron Castings
5 pages
Gcse Sets: Exercise 1 - Constructing Venn Diagrams
No ratings yet
Gcse Sets: Exercise 1 - Constructing Venn Diagrams
5 pages
Tantalum Capacitor 050601 - Ver1
No ratings yet
Tantalum Capacitor 050601 - Ver1
29 pages
Renal Function Tests
100% (1)
Renal Function Tests
7 pages
Linux in Action David Clinton pdf download
100% (1)
Linux in Action David Clinton pdf download
34 pages
Single Lesson: Confident Me
No ratings yet
Single Lesson: Confident Me
22 pages
Participles
No ratings yet
Participles
7 pages
Instant ebooks textbook Fundamentals of Spatial Analysis and Modelling 1st Edition Jay Gao download all chapters
100% (4)
Instant ebooks textbook Fundamentals of Spatial Analysis and Modelling 1st Edition Jay Gao download all chapters
37 pages
131_PDFsam_Essential Grammar in Use 4th Edition by R. Murphy
No ratings yet
131_PDFsam_Essential Grammar in Use 4th Edition by R. Murphy
2 pages

ml_cheatsheet

Uploaded by

ml_cheatsheet

Uploaded by

Machine Learning Algorithms Cheat Sheet

Purpose: Predict continuous target variables.

Purpose: Binary classification.

Purpose: Classification and regression.

Purpose: Classification and regression.

5. Support Vector Machines (SVM)

Purpose: Classification and regression.

6. k-Nearest Neighbors (k-NN)

Purpose: Classification and regression.

8. Gradient Boosting Machines (GBM)

Purpose: Classification and regression.

Purpose: Various tasks including classification, regression, and more.

Purpose: Partition data into k distinct clusters.

Purpose: Create a hierarchy of clusters.

3. Principal Component Analysis (PCA)

Purpose: Dimensionality reduction.

4. Independent Component Analysis (ICA)

Purpose: Separate a multivariate signal into additive, independent components.

Purpose: Discover interesting relations between variables in large databases.

Purpose: Learn efficient codings of input data.

Purpose: Learn the value of actions in states to derive an optimal policy.

2. Deep Q-Networks (DQN)

Purpose: Combine Q-Learning with deep neural networks.

Purpose: Optimize the policy directly.

Purpose: Combine value-based and policy-based methods.

Semi-Supervised and Self-Supervised Learning

Purpose: Utilize unlabeled data to improve model performance.

Purpose: Use multiple views of data to train models.

1. Bagging (Bootstrap Aggregating)

Purpose: Reduce variance and prevent overfitting.

Purpose: Reduce bias and build strong predictive models.

Purpose: Combine multiple models to improve performance.

Additional Algorithms and Techniques

Support Vector Regression (SVR)

Purpose: Regression using SVM principles.

Purpose: Regularized regression combining L1 and L2 penalties.

Gaussian Mixture Models (GMM)

Purpose: Probabilistic clustering.

t-Distributed Stochastic Neighbor Embedding (t-SNE)

Hidden Markov Models (HMM)

Purpose: Model sequential data.

Key Concepts and Terms

Resources and Libraries

Scikit-learn: Comprehensive ML algorithms.

Coursera's Machine Learning by Andrew Ng

You might also like