Big Data Machine Learning Lab 4

This document describes a lab assignment on machine learning classification techniques. The lab contains 4 tasks involving support vector machines, neural networks, and convolutional neural networks applied to wine and handwritten digit datasets. Task 1 and 2 use SVMs and neural networks to classify wine samples. Task 3 and 4 compare regular neural networks to convolutional neural networks for classifying handwritten digits in the MNIST dataset. Instructions are provided on loading and preparing the datasets, designing and training the models, and evaluating performance.

Uploaded by

fahim.samady2001

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

Big Data Machine Learning Lab 4

Uploaded by

fahim.samady2001

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

CS-345/M45 Lab Class 4 Release date: 14/03/2022

Total Marks: 4 Due date: 28/03/2022

Support Vector Machines, Neural Networks, and Convolutional

Neural Networks
This lab is about utilizing Support Vector Machines, Neural Networks, and Convolutional Neu-
ral Networks for classification. We will be looking at applications of the approaches to both the
previously seen Wine dataset, and also a new dataset: MNIST Hand-written digits.

There are 4 marked tasks in this lab; implementing SVMs, Neural Networks, and Convolu-
tional Neural Networks. Task 4.1 and 4.2 compare SVMs and neural networks for multi-class
prediction on multivariate data. Task 4.3 and 4.4 compare the use of neural networks and
convolutional neural networks on the image classification task.

For tasks 4.1 and 4.2, we will be using the Wine dataset. You should re-use the dataset pro-
vided in the previous lab. Links to the dataset are provided on the Canvas page. For tasks
4.3 and 4.4, we will use the MNIST hand-written digit recognition dataset. We will download
this set using Tensorflow. A description of the data, and how to download it, are provided below.

In order to complete this lab you will need to install the following additional Python packages
into your virtual environment: tensorflow

Note: Building and developing deep learning models is a big step up from just calling a
constructor seen in the previous few labs. Building these neural networks requires understanding
how the layers fit together, understanding the shape of the data, the outputs, and the purpose
of each layer type. There are also additional considerations to take into account; including the
role of optimisers, activation functions, layer hyperparameters, and metrics. To this end, this
lab is a bit more involved. Watch the tutorial video, check out the Tensorflow API and really
strive to understand what is happening, and what can be tweaked.
After describing the lab tasks, there is also a section at the end of this lab sheet which describes
how the Tensorflow/Keras framework can be used. It is by no means exhaustive, but should
give you a head-start in completing the lab.
You are also provided with an example Jupyter notebook which implements a fully-connected
neural network model on the Fisher Iris dataset. This notebook looks at defining a network of
Dense layers, with 2 hidden layers and an output layer. It also defines the hyperparameters,
optimisers and metrics used to train the model. It then trains a model and uses it to predict
on the test set. It then plots the loss and accuracy curves, to allow us to gain more insight into
our model’s training.

14
Tasks Using the Wine Dataset
The first two tasks look to classify a given sample into one of three different classes. The
wines all come from the same region of Italy, but have been produced by one of three different
cultivators. The data includes a 13-dimensional multivariate chemical analysis of the wine,
including things like the acidity and alcohol content. The labels are a numerical ID of which
cultivar created the wine (0, 1, 2). The following two tasks will utilise this 13-dimensional space
to train models which predict the label.

Task 4.1 – Multiclass SVM for Wine Data

This first task involves using a Support Vector Machine to predict classes labels on the Wine
dataset.
Your task here is to create, train and predict using an instance of the sklearn.svm.SVC object.
sklearn has many Support Vector Machine algorithms implemented, however we are interested
in the classification task here, so we will use the SVC object.
Load the full Wine dataset and divide it into a training and testing set.
Use the sklearn.preprocessing.StandardScaler class to standardise the data. First,
fit the StandardScaler to the training data, and then apply to both the training and testing
data using the transform() method.
Create and train a multiclass SVM on the training set by creating an instance of the
sklearn.svm.SVC class and the calling its fit() method.
Predict labels for the testing set and report the accuracy of your model. To do this, you
can use the model’s score() method, passing in the testing data and labels.
Visualise the test data using a scatter plot. Colour the markers with the ground truth
labels from the dataset. In an adjacent scatter plot, visualise the test data, but this time
colour the markers by predicted class label. You should end up with a plot similar to the
following:

Figure 4: Plot of Ground Truth vs. SVM predictions.

Go back and explore the different model hyper-parameters (i.e. cost, kernel type etc.),
see if you can improve the accuracy.

15
Task 4.2 – Neural Network for Wine Data
This task looks to apply the Tensorflow framework, and more specifically the Keras submodule
to create a deep learning model, showing how layers can be connected to create the model.
The Tensorflow API can be found at https://www.tensorflow.org/api_docs/python/tf and
there are further, deeper tutorials into Tensorflow and Keras here.
To create our neural network we will create a tensorflow.keras.Sequential model. A helper
notebook is provided to give you a crash-course on the framework.
Use the same standardised training and testing set from Task 4.1
Create and train a Tensorflow Fully Connected Neural Network on the training set. See
the helper notebook and end of this handout for more guidance.
Predict labels for the testing set and report the accuracy of your model on the testing set.
Visualise the test data using a scatter plot. Colour the markers with the ground truth
labels. In an adjacent scatter plot, visualise the test data, but this time colour the markers
by the predicted class label.
Go back and explore the model hyperparameters; for example try changing network’s
layers, or the optimiser. You may not see change in the result, but you can also check the
training curves to observe any impact. Does it overfit more? Is it converging faster?
You should end up with a few plots, as follows:

Figure 5: Plot of Ground Truth vs. Neural Network predictions.

Figure 6: Plot of Neural Network training curves.

16
Task 4.3 – Neural Network for Digit Recognition
Load in the MNIST dataset (see below).
To use a fully connected neural network, you will need to first flatten the data so that
is able to be passed into a Dense network. To do this, use np.reshape() to reshape the
training data into 60000-by-784, and the testing data into 10000-by-784.
Normalise our data by dividing it by 255 (the maximum value in the original data).
Create and train a Tensorflow Fully Connected Neural Network on the training set. See
the helper notebook and end of this handout for more guidance.
Predict labels for the testing set and report the accuracy of your model on the testing set.
Plot your model’s training curves.
Go back and explore the model hyperparameters; for example try changing network’s
layers, or the optimiser.

Task 4.4 – Convolutional Neural Network for Digit Recognition

Use the MNIST data as loaded before, with its original shape of S-H-W.
To use a convolutional neural network, you will need to first expand the data so that it
also has a channel dimension. As our data is grayscale, we only need add an additional
axis in the last dimension of our data. To do this, use np.expand dims() to make training
data 60000-28-28-1, and the testing data 10000-28-28-1.
Normalise our data by dividing it by 255 (the maximum value in the original data).
Create and train a Tensorflow Convolutional Neural Network on the training set. To do
this, you will need to explore the API to find out about Conv2D and Pooling layers within
tensorflow.keras.layers.
Predict labels for the testing set and report the accuracy of your model on the testing set.
Plot your model’s training curves.
Go back and explore the model hyperparameters; for example try changing network’s
layers, or the optimiser.

17
Tasks Using the MNIST Dataset
The tasks here look to classify a given sample into one of 10 different classes. The data is a
28-by-28 grayscale image of a hand-written numerical digit, 0-9, as in Figure 1:

Figure 7: matplotlib plot of an example image from the MNIST dataset

In order to utilise the MNIST dataset, we must first load it into our Jupyter notebook. You can
use Tensorflow’s built-in datasets to do this; loading MNIST requires you to call the following:
(x train, y train), (x test, y test) = tf.keras.datasets.mnist.load data()
This will load the MNIST dataset into 4 variables, x train, y train, x test, and y test.
These correspond to the data (x) and targets (y) for the training and testing sets respectively.
You can then verify the shape of the various variables loaded in:

Figure 8: Shapes of the MNIST dataset loaded from keras.datasets

Tensorflow and the keras.Sequential model class

We can utilise the keras submodule within Tensorflow to create and train our neural network
model. Keras is a part of the Tensorflow framework which is designed to make creating, training,
and deploying deep learning models relatively straightforward. It contains a number of common
implemented layer types, which can be passed into the constructor of the Sequential class.

18
The Sequential object will be a model in which the provided layers are applied one after the
other to the input, producing an output. The class, and the Keras framework, provide an
underlying computation graph which handles the feedforward and backpropagation required
to train the network. Keras also provides the usual fit, predict, and evaluate methods to
allow us to train our model and provide inference on a new observation. More detail on the
Sequential model can be found at the Tensorflow API at https://www.tensorflow.org/api_
docs/python/tf/keras/Sequential
To build a basic fully-connected neural network, we will create a Sequential model object.
The following example creates a fully-connected neural network with 2 hidden layers, and an
output layer. The first hidden layer is a Dense layer with 4 neurons, the second hidden layer
has 10 neurons, and the output layer has 100 neurons. Each layer has an activation function
applied to it once the weights and bias have been applied, the hidden layers have Rectified
Linear activation applied, whilst the output layer has a softmax activation (to place the values
into probability space).

Figure 9: A Sequential model, with 2 hidden layers, and a output layer with 100 outputs.

What is happening in Figure 3? First we create a Sequential model, passing instances of tf.keras
Dense layers to the constructor as a list. We then compile our model with model.compile so
that it can be trained. In the compile method we pass in details regarding which optimiser,
loss, and training metric we want to use to fit our model to the data. There’s a few things to
unpack here:
• tf.keras.optimizers.SGD() This is an instance of a Stochastic Gradient Descent optimiser,
one of the simplest optimisers we can use to train our model. Other optimisers include
Adam(), and RMSProp().
• tf.keras.losses.SparseCategoricalCrossentropy() This is an instance of the categorical
crossentropy loss metric, which works for targets which are labels (rather than one-hot
encoded). Our targets are labels (i.e. IDs are 0-9).
• tf.keras.metrics.SparseCategoricalAccuracy() This is an instance of the categorical crossen-
tropy metric for checking model accuracy. Again, this works for targets which are labels,
like we have loaded in.
Once compiled, our model can be trained using the fit() method. Checking the API, at https:
//www.tensorflow.org/api_docs/python/tf/keras/Sequential#fit, we can see that the fit
method can take in a number of parameters. The key ones are as follows:
• x The data, to be passed through the network in order to train.
• y The targets, the ground truth labels for the data in x.
• epochs The number of passes through the dataset to complete when training the model.

19
• validation split A float between 0.0 and 1.0. This will split x and y into a training and
validation set during training. This will let us see if over-fitting is occurring.

Figure 10: A Sequential model being fit to some data.

Challenge Task 4.5

Some questions to consider:
1. What makes a neural network a “Deep Learning” model?
2. How do I make my neural network deeper? How do I make it wider?
3. How do I train my models for longer?
4. Why does running the methods numerous times result in different accuracy rates?
5. What hyperparameters are available to our models? What happens when we alter the
penalty in the SVM or the optimisation strategy in the neural network?

Computer Vision With Keras
No ratings yet
Computer Vision With Keras
67 pages
Cours 3 - Custom Models and Training With TensorFlow
No ratings yet
Cours 3 - Custom Models and Training With TensorFlow
36 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
No ratings yet
DSE_3141_Deep_Learning_Lab_Manual_2024_Week4
14 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
106106213
No ratings yet
106106213
637 pages
Cad and Dog 2
No ratings yet
Cad and Dog 2
5 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Image Classification using MNIST Dataset
No ratings yet
Image Classification using MNIST Dataset
28 pages
Keras-tensorflow-IT Haarlem 2023
No ratings yet
Keras-tensorflow-IT Haarlem 2023
35 pages
Deep Learning Record
No ratings yet
Deep Learning Record
70 pages
Experiment 3
No ratings yet
Experiment 3
5 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
Chapter04 - Getting Started With Neural Networks
No ratings yet
Chapter04 - Getting Started With Neural Networks
9 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
Cad and Dog
No ratings yet
Cad and Dog
5 pages
dl lab1
No ratings yet
dl lab1
15 pages
Deep Learning lab with Tensorflow (2)
No ratings yet
Deep Learning lab with Tensorflow (2)
84 pages
DL PRACTICAL FILE
No ratings yet
DL PRACTICAL FILE
58 pages
DL INTERNAL
No ratings yet
DL INTERNAL
12 pages
Rec Ex 11
No ratings yet
Rec Ex 11
13 pages
cat_dog_classification_CNN_Model
No ratings yet
cat_dog_classification_CNN_Model
13 pages
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
Dlv Lab Manual Print
No ratings yet
Dlv Lab Manual Print
29 pages
ML Lab Session 05 - CNN Implementation
No ratings yet
ML Lab Session 05 - CNN Implementation
4 pages
DEEP LEARNING EXPERIMENTS
No ratings yet
DEEP LEARNING EXPERIMENTS
42 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
CNN with TensorFlow and Keras
No ratings yet
CNN with TensorFlow and Keras
11 pages
Assignment No 2
No ratings yet
Assignment No 2
8 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
NNDL Lab Manual
No ratings yet
NNDL Lab Manual
39 pages
CCS355-Neural networks and deep learning_____Assignment 1
No ratings yet
CCS355-Neural networks and deep learning_____Assignment 1
15 pages
LP V GRPB 2b
No ratings yet
LP V GRPB 2b
8 pages
dl_22Q71A4206
No ratings yet
dl_22Q71A4206
65 pages
Dl Lab Manual
No ratings yet
Dl Lab Manual
18 pages
24mcs1025-ex2-part-c-wine-dataset
No ratings yet
24mcs1025-ex2-part-c-wine-dataset
3 pages
CCS355-Neural networks and deep learning__Assignment 1
No ratings yet
CCS355-Neural networks and deep learning__Assignment 1
15 pages
ML Ass2
No ratings yet
ML Ass2
8 pages
MVS_Expt8 Object Detection and Reconstruction Using CNN
No ratings yet
MVS_Expt8 Object Detection and Reconstruction Using CNN
5 pages
UNIT_I CHP_5
No ratings yet
UNIT_I CHP_5
26 pages
Aditya Joshi 23252595 Assign 5
No ratings yet
Aditya Joshi 23252595 Assign 5
7 pages
CI Keras
No ratings yet
CI Keras
22 pages
DL Programs
No ratings yet
DL Programs
12 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Deep Learning With Python
100% (4)
Deep Learning With Python
396 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Exercise 2 Building Convolution Neural Network
No ratings yet
Exercise 2 Building Convolution Neural Network
15 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
Image Classification With Convolutional Neural Networks: Plotting
No ratings yet
Image Classification With Convolutional Neural Networks: Plotting
16 pages
dl lab_merged (2)
No ratings yet
dl lab_merged (2)
60 pages
Tensor Flow 2
No ratings yet
Tensor Flow 2
3 pages
Deep Learning and Machine Learning: Lab Explanation
No ratings yet
Deep Learning and Machine Learning: Lab Explanation
34 pages
dlweek7
No ratings yet
dlweek7
9 pages
DL7 2
No ratings yet
DL7 2
11 pages
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
8 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
AI & ML Question Bank
No ratings yet
AI & ML Question Bank
10 pages
Deep Learning With PyTorch 1
No ratings yet
Deep Learning With PyTorch 1
1 page
Presentation Of: The Role of Artificial Intelligence in Architectural Design, Conversation With Designers and Researchers - S.Arch 2020, Tokyo
100% (1)
Presentation Of: The Role of Artificial Intelligence in Architectural Design, Conversation With Designers and Researchers - S.Arch 2020, Tokyo
16 pages
Temporal Difference Learning
No ratings yet
Temporal Difference Learning
17 pages
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
No ratings yet
Assignment - Week 6 (Neural Networks) Type of Question: MCQ/MSQ
4 pages
1.1 - Computational Intelligence PDF
No ratings yet
1.1 - Computational Intelligence PDF
31 pages
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
No ratings yet
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
75 pages
Human Emotion Detectionusing Machine Learning Techniques
No ratings yet
Human Emotion Detectionusing Machine Learning Techniques
8 pages
C V
No ratings yet
C V
2 pages
Dfy Chatbot Dev Using Python
No ratings yet
Dfy Chatbot Dev Using Python
4 pages
Curriculum CVDL Master Program Updated
No ratings yet
Curriculum CVDL Master Program Updated
42 pages
Texture Classification Based On Symbolic Data Analysis: Abstract
No ratings yet
Texture Classification Based On Symbolic Data Analysis: Abstract
6 pages
Cs 503 B Pattern Recognition Jun 2020
No ratings yet
Cs 503 B Pattern Recognition Jun 2020
3 pages
Detection Tracking and Classification of Aircraft and Drones in 2019
No ratings yet
Detection Tracking and Classification of Aircraft and Drones in 2019
36 pages
Full Download (Ebook) Generative Artificial Intelligence: Exploring the Power and Potential of Generative AI by Shivam R Solanki, Drupad K Khublani ISBN 9798868804021, 8868804026 PDF DOCX
No ratings yet
Full Download (Ebook) Generative Artificial Intelligence: Exploring the Power and Potential of Generative AI by Shivam R Solanki, Drupad K Khublani ISBN 9798868804021, 8868804026 PDF DOCX
66 pages
Machine Learning and Data Science With Python
No ratings yet
Machine Learning and Data Science With Python
7 pages
Reading Comprehension - Magazine Article " Artificial Intelligence May Doom The Human Race Within A Century, Oxford Professor Says"
No ratings yet
Reading Comprehension - Magazine Article " Artificial Intelligence May Doom The Human Race Within A Century, Oxford Professor Says"
5 pages
LLM Model
No ratings yet
LLM Model
43 pages
Three Scenarios of Continual Learning
No ratings yet
Three Scenarios of Continual Learning
18 pages
Evaluation of GPT and BERT-based Models On Identif
No ratings yet
Evaluation of GPT and BERT-based Models On Identif
25 pages
DL UNIT 1
No ratings yet
DL UNIT 1
19 pages
Classification Applications With Deep Learning and Machine Learning Technologies
100% (1)
Classification Applications With Deep Learning and Machine Learning Technologies
287 pages
Dap_Latex(ENG)
No ratings yet
Dap_Latex(ENG)
11 pages
Micro-Report-format 5 (1)
No ratings yet
Micro-Report-format 5 (1)
13 pages
Data Science Lab-KTU
No ratings yet
Data Science Lab-KTU
5 pages
Unit-4 AML (1. Basics and K-NN)
No ratings yet
Unit-4 AML (1. Basics and K-NN)
25 pages
AI & ML Syllabus
No ratings yet
AI & ML Syllabus
2 pages
Leancontext: Cost-Efficient Domain-Specific Question Answering Using Llms
No ratings yet
Leancontext: Cost-Efficient Domain-Specific Question Answering Using Llms
8 pages
Data Science - Presentation
No ratings yet
Data Science - Presentation
15 pages
Phase 1 PPT Digit Recognition
No ratings yet
Phase 1 PPT Digit Recognition
8 pages