0% found this document useful (0 votes)

45 views

GlobalLogic - Optimization Algorithms For Machine Learning

This article explores the optimization algorithms for machine learning models. In this use case scenario, we explore how an optimized machine learning model can be used to predict employee attrition.

Uploaded by

Kumar manickam

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views

GlobalLogic - Optimization Algorithms For Machine Learning

This article explores the optimization algorithms for machine learning models. In this use case scenario, we explore how an optimized machine learning model can be used to predict employee attrition.

Uploaded by

Kumar manickam

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

This article explores the optimization algorithms for machine learning models.

In this use case scenario, we explore how an optimized machine learning model

can be used to predict employee attrition.

Introduction
Employers generally consider attrition a loss of valuable employees and talent;

however, there is more to attrition than a shrinking workforce. When employees

leave an organization, they take with them much-needed skills and qualifications they developed
during their tenure. There is no way for employers to know which employees will leave the
company, but a well-trained machine learning model can be used to predict attrition. We will
look at some of the optimization algorithms to improve the performance of the model.

Optimization is the most crucial part of machine learning algorithms. It begins with defining loss
function/cost function and ends with minimizing loss and cost using optimization algorithms
These help us maximize or minimize an error function. The internal parameters of a model play a
very important role in efficiently and effectively training a model and producing accurate results.
This is why we use various optimization algorithms to update and calculate appropriate and
optimum values of a model’s parameters. This, in turn, improves our model’s learning process, as
well as its output.

The article covers the following topics:

1) Dataset

(detailed explanation in the link below)

2) Data Cleaning

(detailed explanation in the link below)

3) Converting Categorical Features to Numerical

(detailed explanation in the link below)

4) Split Between Training and Test Dataset

(detailed explanation in the link below)

5) Training the Model

(detailed explanation in the link below)

6) Checking the Model Accuracy
(detailed explanation in the link below)
7) Considering Alternative Models for Classification

(detailed explanation in the link below)

8) Feature Selection Using Model Importance

(detailed explanation in the link below)

9) Optimizing Model Performance Using Optimization Algorithms

(detailed explanation in the link below)

Head out to *link* read about all the above topics in brief.

It also includes a lot of important code snippets as well.

Coming to the methods themselves, we have;

1)Batch Normalization
Batch normalization is a method used to normalize the inputs of each layer

in order to fight the internal covariate shift problem, thereby improving

the performance and stability of neural networks. This also makes more

sophisticated deep-learning architectures.

The basic idea behind batch normalization is to limit covariate shift by

normalizing the activations of each layer (transforming the inputs to be mean 0

and unit variance). This allows each layer to learn on a more stable distribution

of inputs and would thus accelerate the training of the network.

We normalize the input layer by adjusting and scaling the activations, which

allows each layer of a network to learn more independently of other layers.

2) Grid-Search

Grid-searching is the process of searching the data to configure optimal

parameters for a given model. There are certain parameters necessary

depending on the type of model utilized. Grid-searching does not apply to only

one model type. Grid-searching can be applied to calculate the best parameters

to use for any given model across machine learning. It works in an iterative way.

For some of the parameters associated with the model, we enter good probable

values and the grid-search iterates through each of them, compares the result

for each value, and then gives you the parameters best suited for your model.

3) Stochastic Gradient Descent

Stochastic gradient descent (SGD) is an optimization algorithm in which

samples are selected randomly instead of using a whole data set for each

iteration or using data in the order they appear in the training set. We adjust the

weights after each iteration for our neural network.

In a typical gradient descent, the whole dataset is taken as a batch (the

total number of samples from a dataset used to calculate the gradient for

each iteration) which is problematic when the dataset is significantly large.. It

becomes computationally expensive to perform. Stochastic gradient descent

solves this problem by using a single sample to perform each iteration.

Clink on the link to read about the methods in detail. There are advantages and disadvantages
provided for each along with code snippets.

Conclusion
We implemented different models to predict attrition in a company, measured

their accuracy, and employed the various optimization algorithms on a support

vector machine to optimize its parameters. We observed that the accuracy

of a model is improved by 3.4% - 94% without optimization and 97.4% with

optimization using grid search. In this case, it is not a significant improvement.

However, in reality we might have many more data sets where optimization

improves performance significantly.

The purpose of the paper is to give an idea of various optimization techniques

and how optimization helps to improve performance of any machine learning

model.

Finally, we have a working model to predict which employees will leave the

company and who will stay based on five input parameters with an accuracy of

almost 98 percent.

Solid Starts - First 100 Days
94% (18)
Solid Starts - First 100 Days
287 pages
Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
The Hold Me Tight Workbook - Dr. Sue Johnson
100% (16)
The Hold Me Tight Workbook - Dr. Sue Johnson
187 pages
Read People Like A Book by Patrick King-Edited
62% (66)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Cheat Code To The Universe
94% (77)
Cheat Code To The Universe
34 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
94% (212)
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
212 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (541)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (28)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
75% (12)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
36 Questions To Fall in Love 1
97% (31)
36 Questions To Fall in Love 1
2 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
ALCHEMIST
64% (14)
ALCHEMIST
4 pages
1001 Songs
71% (69)
1001 Songs
1,798 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
ML Performance Improvement Cheatsheet
No ratings yet
ML Performance Improvement Cheatsheet
11 pages
OTC Updater ReadMe Final
0% (1)
OTC Updater ReadMe Final
3 pages
Summary of The Article: A Few Useful Things To Know About Machine Learning
No ratings yet
Summary of The Article: A Few Useful Things To Know About Machine Learning
3 pages
OpenText StreamServe 5.6 StreamOUT User Guide
No ratings yet
OpenText StreamServe 5.6 StreamOUT User Guide
32 pages
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
No ratings yet
Q. (A) What Are Different Types of Machine Learning? Discuss The Differences
12 pages
Formal Language - C2+TC2
No ratings yet
Formal Language - C2+TC2
21 pages
Chapter 4 - A Primer On Machine Learning For Marketing Analytics
No ratings yet
Chapter 4 - A Primer On Machine Learning For Marketing Analytics
23 pages
unit-2
No ratings yet
unit-2
16 pages
Chapter-3-Common Issues in Machine Learning
No ratings yet
Chapter-3-Common Issues in Machine Learning
20 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
4 pages
ANN Analysis
No ratings yet
ANN Analysis
5 pages
TB 969425740
No ratings yet
TB 969425740
16 pages
Data Science
No ratings yet
Data Science
38 pages
Unit 2
No ratings yet
Unit 2
19 pages
ML Unit 2
No ratings yet
ML Unit 2
18 pages
5.3 Model
No ratings yet
5.3 Model
26 pages
Main Dock Pin
No ratings yet
Main Dock Pin
31 pages
ML & DL
No ratings yet
ML & DL
19 pages
ML viva questions
No ratings yet
ML viva questions
25 pages
Air quality prediction using machine learning
No ratings yet
Air quality prediction using machine learning
29 pages
Software Defect Prediction Using Ensemble Learning
No ratings yet
Software Defect Prediction Using Ensemble Learning
6 pages
Unit - 2 Deep Learning
No ratings yet
Unit - 2 Deep Learning
26 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
ML_DA
No ratings yet
ML_DA
55 pages
Data Mining Project 11
No ratings yet
Data Mining Project 11
18 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
13 pages
Feature Engineering and Normalization
No ratings yet
Feature Engineering and Normalization
7 pages
Machine Learning Qs
No ratings yet
Machine Learning Qs
10 pages
Introduction_to_Machine_Learning_Exercises
No ratings yet
Introduction_to_Machine_Learning_Exercises
18 pages
Approach Towards Model Evaluation, Model Selection
No ratings yet
Approach Towards Model Evaluation, Model Selection
13 pages
20 Questions On Feature Engineering and Eda
No ratings yet
20 Questions On Feature Engineering and Eda
9 pages
Model Evaluation
No ratings yet
Model Evaluation
29 pages
Unit 3
No ratings yet
Unit 3
13 pages
Lecture 4 Machine Learning - Bcsc
No ratings yet
Lecture 4 Machine Learning - Bcsc
45 pages
Notes XII AI.docx
No ratings yet
Notes XII AI.docx
11 pages
Data Prep
No ratings yet
Data Prep
5 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
26 pages
Unit 1 AAM
No ratings yet
Unit 1 AAM
16 pages
Interview Questions
No ratings yet
Interview Questions
2 pages
DP-Designing and Implementing
No ratings yet
DP-Designing and Implementing
10 pages
ML 22-23 Sem, GPT
No ratings yet
ML 22-23 Sem, GPT
14 pages
Data Science Important Interview Questions & Answers✅
No ratings yet
Data Science Important Interview Questions & Answers✅
19 pages
Machine Learning Assignment (1)
No ratings yet
Machine Learning Assignment (1)
5 pages
Electricity Load Forecasting - Intelligent
No ratings yet
Electricity Load Forecasting - Intelligent
10 pages
Samatrix Assignment3
No ratings yet
Samatrix Assignment3
4 pages
DSF - UNIT III Notes
No ratings yet
DSF - UNIT III Notes
17 pages
Module 3.5 Ensemble Learning XGBoost
No ratings yet
Module 3.5 Ensemble Learning XGBoost
26 pages
Unit Ii ML
No ratings yet
Unit Ii ML
57 pages
Key Terms in Machine Learning
No ratings yet
Key Terms in Machine Learning
6 pages
Lecture 5 - Feature extraction, model building & evaluation
No ratings yet
Lecture 5 - Feature extraction, model building & evaluation
35 pages
Week 10 - PROG 8510 Week 10
No ratings yet
Week 10 - PROG 8510 Week 10
16 pages
Machine learning assignment (3) (1)
No ratings yet
Machine learning assignment (3) (1)
5 pages
Feature Pruning and Normalization
No ratings yet
Feature Pruning and Normalization
8 pages
Predicting Stock Values Using A Recurrent Neural Network
No ratings yet
Predicting Stock Values Using A Recurrent Neural Network
12 pages
Machine learning assignment (3)
No ratings yet
Machine learning assignment (3)
5 pages
HumanActivityClassification ML
No ratings yet
HumanActivityClassification ML
6 pages
Data Mining Algo
No ratings yet
Data Mining Algo
8 pages
Module I Complete Notes
No ratings yet
Module I Complete Notes
136 pages
In Depth Explanation of Machine Learning Concepts
No ratings yet
In Depth Explanation of Machine Learning Concepts
3 pages
Bard Advices
No ratings yet
Bard Advices
35 pages
Which Machine Learning Algorithm Should I Use - The SAS Data Science Blog
No ratings yet
Which Machine Learning Algorithm Should I Use - The SAS Data Science Blog
15 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Serial Peripheral Interface (SPI) Tutorial
100% (2)
Serial Peripheral Interface (SPI) Tutorial
6 pages
Ics 405 Final
No ratings yet
Ics 405 Final
1 page
SARMAP Brochure
No ratings yet
SARMAP Brochure
2 pages
210-250 Free Dump
No ratings yet
210-250 Free Dump
5 pages
How To Interconnect Multiple Ucm
No ratings yet
How To Interconnect Multiple Ucm
8 pages
Linked List Problems
100% (2)
Linked List Problems
35 pages
CR Java Startup Guide
No ratings yet
CR Java Startup Guide
21 pages
Pocket Certificates Using Double Encryption: Features
No ratings yet
Pocket Certificates Using Double Encryption: Features
4 pages
Kendriya Vidyalaya Sangathan Sample Paper
No ratings yet
Kendriya Vidyalaya Sangathan Sample Paper
15 pages
Introduction To Adobe Photoshop
No ratings yet
Introduction To Adobe Photoshop
24 pages
Microsoft Toolkit 2.4.9
0% (1)
Microsoft Toolkit 2.4.9
2 pages
Value Help in Adobe Interactive Forms: Installation (ZCI)
No ratings yet
Value Help in Adobe Interactive Forms: Installation (ZCI)
6 pages
Gas Agency Management
No ratings yet
Gas Agency Management
26 pages
Readme
No ratings yet
Readme
3 pages
Avaya Application Enablement Services Configuration
No ratings yet
Avaya Application Enablement Services Configuration
13 pages
Asas
No ratings yet
Asas
2 pages
tms320f241 PDF
No ratings yet
tms320f241 PDF
125 pages
AutoCAD Civil 3D Help - Layer Properties Manager
No ratings yet
AutoCAD Civil 3D Help - Layer Properties Manager
3 pages
Dart Client Side Web Programming PDF
No ratings yet
Dart Client Side Web Programming PDF
7 pages
Java Programming Questions
100% (1)
Java Programming Questions
50 pages
Java Collections Interview Questions
No ratings yet
Java Collections Interview Questions
7 pages
Rfid Based Security System Using Arduino Module
No ratings yet
Rfid Based Security System Using Arduino Module
3 pages
Computer Architecture: Lecture: 9 - 13 Types of Memories Cache, Internal and External
No ratings yet
Computer Architecture: Lecture: 9 - 13 Types of Memories Cache, Internal and External
69 pages
VLSM
No ratings yet
VLSM
8 pages
Bannari Amman Institute of Technology
No ratings yet
Bannari Amman Institute of Technology
10 pages
Azazel
No ratings yet
Azazel
6 pages
Ooad Question Bank
No ratings yet
Ooad Question Bank
21 pages