0% found this document useful (0 votes)

8 views

Assignment DL

Uploaded by

venkatakalyan2nd

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Assignment DL

Uploaded by

venkatakalyan2nd

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

DEPARTMENT OF CSE (AI/ML)

Name: R. ATULIYA
USN:21BTRCL131
Subject: DEEP LEARNING Semester: 6th SEM
Subject Code: 21CS6AM11 Total Hours: 45
Credits: 03 Hours per week: 03
Faculty Name: Prof. Sahana Shetty Academic Year: 2022-23
Due Date: 01.03.24

Assignment Questions 1– 18.03.2024

Sl. No Question CO’s Bloo

ms
Level
1. 1. Company named “SSS” products expensive and high quality CO1 L3
product which has 2 features which are measured as X1
curvature and X2 diameter. Class represents the quality
control result marked as 0 Failed and 1 as Passed from the below
mentioned table. Solve the problem using:
3.52a. Maximum Likelihood Estimation considering new
curvature value to be 2.33 and Diameter to be 5.5.

b. Discrete Distribution (Bernoulli distribution or Multinomial

distribution)
c. Continuous Distribution (Gaussian distribution)

a. Maximum Likelihood Estimation (MLE):

After analyzing the data and assuming it conforms to a distribution
ANSWER we can work out the mean and variance for each category (Passed
and Failed) using the given data. Subsequently we can leverage
these parameters to estimate the probability of the data point falling
into each category.
import numpy as np

data = np.array([
[0, 2.93, 6.63],
[0, 2.53, 7.79],
[0, 3.57, 5.65],
[0, 3.16, 5.47],
[1, 2.58, 4.46],
[1, 2.16, 6.22],
[1, 3.27, 3.52]
])

X = data[:, 1:] # Features (curvature and diameter)

y = data[:, 0] # Labels (quality control result)

# Separating data based on labels

passed = X[y == 1]
failed = X[y == 0]

mean_passed = np.mean(passed, axis=0)

var_passed = np.var(passed, axis=0)
mean_failed = np.mean(failed, axis=0)
var_failed = np.var(failed, axis=0)

new_point = np.array([2.33, 5.5])

likelihood_passed = np.prod(1 / np.sqrt(2 * np.pi * var_passed) *

np.exp(-((new_point - mean_passed) ** 2) / (2 * var_passed)))
likelihood_failed = np.prod(1 / np.sqrt(2 * np.pi * var_failed) *
np.exp(-((new_point - mean_failed) ** 2) / (2 * var_failed)))

print("Likelihood of passing:", likelihood_passed)

print("Likelihood of failing:", likelihood_failed)

if likelihood_passed > likelihood_failed:

print("Prediction: Passed")
else:
print("Prediction: Failed")

b. Categorical Distribution (Bernoulli distribution or

Multinomial distribution):
In the case of Bernoulli distribution you would evaluate each
characteristic independently. Model the likelihood of passing based
on each feature.
For distribution you would assess both features and model the
combined probability of passing considering all features.

c. Continuous Distribution ( distribution):

When employing a distribution approach we make an assumption
that the data adheres to a distribution pattern and determine the
probability of the new data point belonging to each category based
on the mean and variance of features for each category.
2. Consider you are given a work of building a model to predict CO1 L2
housing prices based on various features such as location, square
footage, number of bedrooms, and neighborhood amenities.
You've collected a dataset containing information on thousands
of houses, including their features and sale prices. For the given
scenario specify over fit, underfit and balanced fit in a model.

ANSWER
1. Overfitting:

Description: Overfitting occurs when a model learns the training

data too well, capturing noise or random fluctuations that are not
representative of the true underlying relationship between the
features and the target variable.

Example: In our scenario, an overfitted model might perfectly

predict the sale prices of the houses in the training dataset but
perform poorly on new, unseen data. For instance, it might capture
very specific details about certain houses that are unique to the
training data but not generalizable to new houses.

Solution: To address overfitting, we can use techniques like cross-

validation, regularization, or reducing the complexity of the model
(e.g., using fewer features or adding constraints).

2. Underfitting:

Description: Underfitting occurs when a model is too simple to

capture the underlying structure of the data, leading to poor
performance on both the training and test datasets.

Example: In our scenario, an underfitted model might fail to

capture important patterns or relationships between the housing
features and prices. It may produce inaccurate predictions even on
the training data.

Solution: To address underfitting, we can try using more complex

models, adding more relevant features, or increasing the model's
capacity (e.g., increasing the number of layers in a neural network).

3. Balanced Fit:

Description: A balanced fit occurs when the model captures the

underlying patterns in the data without overfitting or underfitting. It
generalizes well to new, unseen data, producing accurate
predictions.

Example: In our scenario, a balanced fit model would accurately

predict housing prices based on features like location, square
footage, number of bedrooms, and neighborhood amenities. It
demonstrates strong performance on both the training and test sets.
Solution: Achieving a balanced fit often requires experimentation
with different models, feature engineering techniques, and
hyperparameter tuning. It's about finding the right level of
complexity that captures the essential patterns in the data without
memorizing noise or being too simplistic.

3. Mathematically derive Backpropagation. CO1 L2

ANSWER The backpropagation algorithm is a key component in training

neural networks. Here, I'll outline the mathematical derivation of
backpropagation.

Let's consider a simple neural network with one hidden layer.

We'll denote:

Forward Pass:

For each hidden neuron j, compute the weighted sum of inputs:

Apply the activation function to obtain the output of the hidden

neuron:

Then, for the output neuron, compute the weighted sum of

hidden neuron outputs:

And apply an activation function to obtain the predicted output:

Loss Function:

Define a loss function, typically the mean squared error:

Backward Pass:

Compute the gradient of the loss function with respect to the

weights:

Gradient Descent:

Update the weights using gradient descent:

Where η is the learning rate.

Repeat:

Repeat the forward and backward pass steps for multiple

iterations until convergence.

This derivation demonstrates how backpropagation computes the

gradients of the loss function with respect to the weights of the
neural network, enabling us to update the weights in the direction
that minimizes the loss function.
DEPARTMENT OF CSE (AI/ML)

Name: R. ATULIYA
USN: 21BTRCL131
Subject: Deep learning Semester: 6th SEM
Subject Code: 21CS6AM11 Total Hours: 45
Credits: 03 Hours per week: 03
Faculty Name: Ms. Sahana Shetty Academic Year: 2022-23
Due Date: 01.03.24

Assignment Questions 2– 18.03.2024

Sl. No Question CO’s Bloo

ms
Level
1. Given any dataset of your choice give an application of Deep CO2 L3
Learning. (Apart from the once covered in lab)

ANSWER Deep Learning, has significant applications in the world of

agriculture. considering the application of deep learning in
precision agriculture, specifically in crop disease detection

Application: Plant disease Identification using Deep Learning.

Dataset: Let us assume that you have a dataset with pictures of both
healthy and sick crops, each characterizing the specific type of
disease in the picture.

1. Problem Statement : Identification and management of plant

diseases on time is one of the major challenges for farmers who can
loose their crops in case it will be too late. These challenges can
ultimately result in crop yield losses and economic impact. Deep
learning has the ability to be applied to the automated performance
of disease detection, helping an earlier diagnosis and the
intervention.

2. Approach:
- Assemble and come up with a set of images that simulate different
disease conditions and healthy crop images.
- Make available a deep learning model, like a convolutional neural
network (CNN), to segment images into either healthy or ill classes.
- Make model suitable on given dataset by using tuning hyper
parameters and architecture in order to improve performance.
- Make the model a valid one using an independent dataset or cross-
validation techniques to guarantee that it does not overfit.

3. Deployment:
- Deploy the model into a crop monitoring tools where the machine
can analyze the images from drones, satellites, or smartphones in
real time.
- Farmers will be able to spot early symptoms of a disease in their
crops before a substantial loss occurs, allowing them to take timely
measures such as spraying pesticides or doing crop rotation.

4. Benefits:
- Early Detection: The capability of deep learning facilitates an
early detection of farm diseases, which, in turn, enables farmers to
take timely measures in order to confine the damages to minimum
varieties only.
- Precision Agriculture: The method through which the problematic
regions using within the field are accurately identified would help
ensure that the intervention is targeted as much as possible with
minimum amounts of pesticides being sprayed making the
environmental impact minimal.
- Increased Yield: In time the dilemma of disease management can
be handled successfully, and both crop health and yield gain is
ensured, then food security and economic sustainability are
achieved.

5. Challenges:
- Data Quality: Including more varieties in the data sets and quality
in it comes along with the goal of obtaining more Robust model
that be adapted for new environment and disease types.
- Interpretability: The deep learning models, being the emphasis on
the latter of of CNNs, are usually perceived as black boxes, making
very much more difficult to get behind probable reasoning that
leads to their predictions. The utility of attention mechanisms, for
example, finds the balance between conceptual understanding and
practical handling of various situations.

Through the use of deep learning techniques in the area of crop

disease detection, farmers will be enabled to stand out among
precision agriculturists and the whole of sustainable food
production will be topped in the line of global food security.

2. Given any dataset of your choice give an application of Deep CO2 L2

Reinforcement learning.

ANSWER Take the instance of autonomy planning with means of Deep

Reinforcement Learning (DRL).

Application: Autonomous Driving

Dataset: We can use a dataset generated from simulations or real-

world driving scenarios. This dataset would contain information
such as images from onboard cameras, lidar data, GPS coordinates,
vehicle speed, steering angles, throttle and brake inputs, and
potentially other environmental factors like traffic conditions and
weather.

Problem Statement: The goal is to train an autonomous driving

agent that can navigate through various traffic scenarios safely and
efficiently, following traffic rules and reaching its destination in a
timely manner.

Deep Reinforcement Learning Approach:

1. State Representation: The raw input data (images, lidar data, etc.)
would be preprocessed and used to represent the state of the
environment. This could involve techniques such as feature
extraction, dimensionality reduction, and normalization.

2. Action Space: The agent's actions would consist of steering

angles, throttle, and brake inputs, which it can adjust to control the
vehicle.

3. Reward Function: Define a reward function that incentivizes safe

and efficient driving behavior. For example, the agent receives
positive rewards for staying in its lane, obeying traffic signals,
avoiding collisions, and making progress towards the destination.
Negative rewards are given for violations like speeding, collisions,
and erratic driving behavior.

4. Training: Train the agent using deep reinforcement learning

algorithms such as Deep Q-Networks (DQN), Proximal Policy
Optimization (PPO), or Deep Deterministic Policy Gradient
(DDPG). The agent learns by interacting with the environment,
observing states, taking actions, and receiving rewards. Through
trial and error, it learns a policy that maximizes expected
cumulative rewards over time.

5. Evaluation: Evaluate the trained agent on unseen driving

scenarios, both in simulation and potentially in real-world testing.
Assess its performance in terms of safety, efficiency, and adherence
to traffic rules.

By applying deep reinforcement learning to autonomous driving,

we aim to develop agents capable of navigating complex and
dynamic environments, ultimately contributing to the advancement
of self-driving technology.

3. You are tasked with developing a deep learning model for facial CO2 L3
expression recognition. Given images of faces, the model needs
to classify the facial expressions into one of several categories
such as happiness, sadness, anger, etc. Discuss the potential
advantages and limitations of each variant in the context of
optimizing the model's performance for accurately classifying
facial expressions. (all the types discussed in class)
ANSWER
1. Convolutional Neural Networks (CNNs):

Advantages:
- Visual networks, CNNs, are predisposed for processing image-
oriented tasks, such as facial recognition (of emotions), thanks to
their capability of learning hierarchical features straight from the
raw pixel data.
- They can observe and capture local spatial patterns of facial
geometry which contain the features of eyes, the shape of nose
and mouth etc and their variations that are very specific for
recognizing facial expressions.
- CNNs have capabilities of accepting the input images of every
size and every angle making them potentially suited to different
datasets of facial expressions.

Limitations:
- The CNNs might need a huge amount of narrated data for
training to provide them a possibility of complexity in facial
expressions that can be difficult and costly to achieve.
- They might be not able to convert the new forms or variation of
expression in lighting, pose and face appearance in sample data.
- Occasionally unneeded computational power and time may be
consumed if CNNs are taught from the beginning to recognize
facial expressions.

2. Recurrent Neural Networks (RNNs)

Advantages:
- Mostly, RNNs, especially the LSTM versions, have capacity to
track the correlations of varying facial expressions in temporal
sequence, and therefore grasp the dynamic expressions of the
images extending over multiple frames.
- They are good in dealing with sequential data with just a
scratch and make memory representations of previous sequences
of facial landmarks or video frames.
- The RNNs, being the best choice to face this kind of situations
where faces develop from the start till the end like in videos or
real life feeds.

Limitations:
- RNNs could be weak in capturing long-range dependencies, or
even losing small details in facial expressions through the long
time, this might result in the error called Exploding or Vanishing
gradients.
- Hyperparameter fine-tuning is necessary for their stability; this
may involve adjusting the learning rate, sequence length, and
other parameters to prevent training overfitting and enhance
training convergence.
- Running RNNs on multi-scale datasets will incur a
computational cost with increased time required for process-
learning, even when you are only dealing with high-resolution
videos.

4. Imagine the company has a vast dataset containing information CO2 L3

about customer interactions, such as browsing history, purchase
behavior, and product reviews. Your task is to implement a
recommendation system using Restricted Boltzmann Machines
(RBMs). Describe breifely steps involved in training the RBM
considering the given dataset.

ANSWER
The steps involved in training a Restricted Boltzmann Machine
(RBM) for a recommendation system using the given dataset:The
steps involved in training a Restricted Boltzmann Machine (RBM)
for a recommendation system using the given dataset:

1. Data Preprocessing:
- Change the data-set into a model's choice of training the RBM.
- Use one dimension vector to demonstrate the customer
interactions (e.g., browsing history or purchase behavior ) and
represent it into a binary vector where each element shows the
presence or absence of a specific item or activity.
- Normalization or scaling of the data is required if the data behave
differently, such as they are non-uniform, for the sake of consistent
input scale of the RBM.

2. Initialization:
- The parameters of a Boltzmann machine should be initialized,
including the synaptic weights W connecting the visible and
hidden layers, the visible layer biases a and the hidden layer biases
- Specify the parameters with random initialization or apply
techniques like Xavier or He initialization that set initial values to
calculate the initial values.

3.Training:
- Get held the CD (CD) or PCD (PCD), respectively, to train the
RBM.
- Pick out a chunk of customer interaction data through sampling
from the dataset.
- Execute Gibbs sampling to update the hidden units using the
visible units' values; the same logic should be applied to the visible
units, updating their values with the help of those from the hidden
units.
- Calculate the gradients of the RBM parameters utilizing the
customized function of the contrastive divergence algorithm.
- Reconfigure the RBM with gradient descent method or any its
alternative (example: stochastic gradient descent, Adam) by
altering weights and biases.

4. Repeat:
- Keep modifying strategy for multiple epochs until meeting the
convergence criteria.
- Assess the accuracy of training by conducting an examination of
reconstruction error or other indicators of the performance metrics
on a validation dataset.
5. Model Evaluation:
- Assess the trained RBM model's performance using a validation
set on a level of reaching to the learn's accuracy.
- Use either accuracy, precision, and recall as a measure of the
model performance or mean squared error (MSE) as a measure of
the error between the predicted target value and the actual observed
target value.
- Determine and compare the RBM's performance to another
recommendation method or a baseline method to determine
whether it is effective.

6. Fine-tuning and Hyperparameter Tuning

- Seek to adjust the RBM parameters to properly set the learning
rate, batch size, the number of hidden units and training epochs in
order to enhance the performance.
- Try variants of architectures and regularization methods that will
prevent the model from overfitting and increase the generalization
over the newly acquired knowledge instead.

7. Deployment:
- The RBM model should be trained and evaluated until its
performance is good enough to proceed, then, integrate it into a
production environment to serve individualized recommendations
to customers.
- Implement the RBM inside the business recommendation system
framework, which gives ability in processing customers' real-time
interactions and providing relevant content.

Implement the followings, and you will be able to teach an

Restricted Boltzmann Machine (RBM) for a recommendation using
the given dataset. The model will be able to provide personalized
recommendations after analysis of customers' interactions.

5. What is the capacity of Hopfield network calculated. CO2 L2

The storage capacity, most often abbreviated as “H” for Hopfield

ANSWER
network, stands for the number of patterns or memories which, with
a certain degree of reliability, are able to be stored and later
retrieved. The figure of a Hopfield network capacity passes through
some considerations - the number of the network neurons, the
pattern dependencies, and the network dynamics.

Theoretical Analysis:
This upper bound on the capacity of an N-neuron Hopfield
network, theoretically, is around 0.138N, which is J. J. Hopfield’s
result according to the seminal work on associative memory. This
is analogous to saying that jamming has plentiful patterns stored
within the network network which they resemble as independent
and random. In other scenario, it means that there is a maximum
number of patterns in the network with precise recalling.
Calculation based on Overlap:
The comparison method with the Hopfield network's capacity can
be also assessed by analyzing the network’s ability to learn by its
overlapping patterns. Commonly, in speech recognition, the pattern
capacity is computed as a dot product of stored patterns. Following
John Hopfield and David Tank model, the network’s critical
capacity can be computed by 0.15N, where N is the neuron number
of the given network.

Practical Considerations:
In reality, the operation of the Hopfield Network might be less than
the theoretical and critical limits because what limits these are the
factors such as noise, pattern correlation as well as dynamic of the
network. The actual capacity of a Hopfield network may fluctuate
depending on particular implementation details like network
architecture, learning rule, and the distress that is caused by noise.
DEPARTMENT OF CSE (AI/ML)

Assignment Questions 3– 18.03.2024

Sl. No Question CO’s Bloo

ms
Level
1. Given any dataset of your choice compare with various CNN CO4 L3
architecture like ResNet, AlexNet, VGG16, Stacked CNN, Dilated
CNN, Inception Network and LeNet.

ANSWER 1. ResNet (Residual Networks):

- Accuracy: ResNet's high accuracy on CIFAR-10 dataset comes
primarily due to its deep architecture and residual connection, traits
which train very deep networks efficiently. Accuracy is as high as
90% plus when variable training is applied.
- Model Complexity: ResNet exhibits relatively more complex
structure than the rest of interconnections because of deep model
that uses the skip connections to skip the layers. It is probable that
longer training periods and the need for more memory will result
from this fact.
- Training Time: Training ResNet might encounter the problem of
time-consumption as it is profound and comprises of a multi-layer
of features. On the flip side, machine learning algorithms
leveraging pretraining or transfer learning that filter data can
accelerate training.

2. AlexNet:
- Accuracy: By this we would get a figure of nearly one hundred
percent since in CIFAR-10 it performs just so-so, giving a little less
accuracy than modern architectures such as ResNet, VGG and
others. It is capable of meeting the accuracy requirement with
numbers in the range of 80 – 85%.
- Model Complexity: AlexNet has a moderate model complexity
compared to ResNet. It is usually preferred for applications that are
not resource-intensive, such as small-scale machine learning
models or low-cost smartphones.
- Training Time: As per experience, AlexNet takes less time to
train than the more complex networks like ResNet, however, the
amount of hardware capacity it needs might still be large.

3. VGG16:
- Accuracy: VGG16 is widely recognized for it being less
complicated and yet still efficient. It produces good performance
competing with that of CIFAR-10 which is slightly lower in the
range of 85-89%.
- Model Complexity: VGG16's model complexity is mainly due to
its deep structure and small catch sizes of filters. The parameters of
VGG16 are greater than AlexNet while they are smaller than
ResNet.
- Training Time: Train-time of VGG16 is quite heavy since of its
depth and multiple parameters. Nevertheless, ResNet simplicity as
opposed to the Conv Net can favour faster convergence.

4. Stacked CNN:
- Accuracy: For an architect with a stacked model, multiple
convolutional layers are placed one after the other. The precision
provided by stacked CNNs relies on the structure of the involved
architecture and the depth. With more than enough complexity,
they can match the accuracy of VGG or ResNet if implemented
well.
- Model Complexity: By manipulating two parameters:the number
of layers and filter sizes, we end up with models of varying
complexity. As the architectures grow deeper, they become more
complex and have to sequentially use many parameters.
- Training Time: The depth and complexity of stacked CNNs is
the factor that will determine their training time. More profound
structure might need more long training but attaining an
outstanding result in the possible way.

5. Dilated CNN:
- Accuracy: Big CNNs use dilated convolutions to scale up a
receptive area avoiding parameter increase at the same time. They
may win in CIFAR-10 like VGG or ResNet by the measures of the
most competitive accuracy.
- Model Complexity: Dilated CNNs are a middle-class model
complexity level, whereas other structures as VGG or ResNet are
considered with larger model complexity. They rather have fewer
paramets since the dilation convolutions are in the game.
- Training Time: Investigated time for dilated CNNs in most cases
does not differ from it for VGG or ResNet type of CNNs, because
they are similar in their model complexity.

6. Inception Network (GoogLeNet):

- Accuracy: Inception nets power themselves with many parallel
convolutional strands to beat the image datasets well. The
automated commerce platforms can replicate the precision of VGG
or ResNet architectures.
- Model Complexity: Inception networks tend to be convoluted in
modelling as they make use of multiple instances in parallel and
possess vast number of trains of parameters. They are more power
consuming than their forerunners, which are VGG and AlexNet, as
they need the greater computational resources and other
requirements.
- Training Time: It may take time longer to train inception models
as compared to simple architectures. It can take this due to their
complexities.

7. LeNet:
-Accuracy: LeNet being one of first CNNs introduced and might
not yield works as the accuracies achieved by modern architectures
in CIFAR-10 dataset. It can show an accuracy of up to plus/minus
70%.
-Model Complexity: Unlike VGG and ResNet, LeNet has a
simplistic model and, therefore, can be considered a low level
model complexity model. It is modelized with a lesser number of
parameters which makes it less calculationally intensive.
-Training Time: Humanize: LeNet and the deeper architectures
VGG and ResNet are two platforms being used for training. The
simplicity and lower model complexity of the LeNet is the reason
for faster training and high performance compared to the VGG and
ResNet.

2. Given any dataset of your choice demonstrate the use of CO3 L3

Regularization (all types).
ANSWER
Let's consider the famous Iris dataset, which contains information
about different species of iris flowers, including sepal and petal
dimensions. We'll perform classification using logistic regression
and apply various regularization techniques to mitigate overfitting.

Lasso Regression (L1 Regularization): Lasso regression adds a

penalty term equal to the absolute value of the coefficients to the
loss function. This penalizes large coefficients and encourages
sparsity in the model.

Ridge Regression (L2 Regularization): Ridge regression adds a

penalty term equal to the square of the coefficients to the loss
function. This penalizes large coefficients and tends to shrink them
towards zero.

Elastic Net Regression: Elastic Net regression combines both L1

and L2 regularization by adding penalties for both the absolute
value and the square of the coefficients. This allows for a balance
between feature selection (sparsity) and coefficient shrinkage.
DEPARTMENT OF CSE (AI/ML)

Assignment Questions 4– 18.03.2024

Sl. No Question CO’s Bloo

ms
Level
1. Applications of Deep Recurrent Networks and Recursive Neural CO4 L3
Networks.

Deep recurrent networks (RNNs) and recursive neural networks

(RecNNs) have various applications across different domains due
to their ability to model sequential and hierarchical data effectively.
Here are some applications for each:

Deep Recurrent Networks (RNNs):

1. Natural Language Processing (NLP):

Sentiment Analysis: Analyzing sentiment in text data, such as

movie reviews or social media posts.
Machine Translation: Translating text from one language to
another.
Named Entity Recognition (NER): Identifying named entities like
persons, organizations, and locations in text.
Text Generation: Generating human-like text, such as in chatbots
or content creation.

2. Time Series Prediction:

Stock Price Prediction: Predicting future stock prices by

analyzing historical data.
Weather Forecasting: Predicting future weather conditions based
on past observations.
Demand Forecasting: Estimating future demand for products or
services.

3. Speech Recognition:

Speech-to-Text Conversion: Converting spoken language into

written text, commonly used in virtual assistants like Siri or Google
Assistant.
Speaker Identification: Identifying speakers based on their voice
characteristics.

4. Sequence-to-Sequence Learning:

Chatbots: Generating responses in a conversational setting.

Question Answering: Answering questions based on context, such
as in search engines or virtual assistants.

Recursive Neural Networks (RecNNs):

1. Image Processing:

Image Captioning: Generating natural language descriptions for

images.
Object Recognition: Identifying objects within images and their
relationships.

2. Graph-Structured Data:

Social Network Analysis: Analyzing connections and

communities within social networks.
Molecular Structure Analysis: Predicting properties or activities
of molecules based on their structure.

3. Parsing and Syntax Tree Processing:

Natural Language Parsing: Analyzing the grammatical structure

of sentences.
Semantic Role Labeling: Identifying the roles of words in a
sentence, such as subject, object, etc.

4. Document Understanding:

Document Classification: Categorizing documents into predefined

categories, such as spam detection or topic classification.
Information Extraction: Extracting structured information from
unstructured text documents, such as extracting entities or
relationships.

2. Given any dataset of your choice demonstrate BPTT in RNN. CO3 L3

ANSWER import numpy as np

sequence_length = 10 # Length of the sequence
input_size = 1 # Size of each input element
hidden_size = 5 # Size of the hidden state
output_size = 1 # Size of the output

np.random.seed(0)
data = np.random.randn(sequence_length, input_size)

Wxh = np.random.randn(hidden_size, input_size) # Input-to-

hidden weights
Whh = np.random.randn(hidden_size, hidden_size) # Hidden-to-
hidden weights
Why = np.random.randn(output_size, hidden_size) # Hidden-to-
output weights
bh = np.zeros((hidden_size, 1)) # Hidden bias
by = np.zeros((output_size, 1)) # Output bias

learning_rate = 0.01
epochs = 1000

def forward(inputs, hprev):

xs, hs, ys, ps = {}, {}, {}, {}
hs[-1] = np.copy(hprev)
for t in range(len(inputs)):
xs[t] = inputs[t].reshape(input_size, 1)
hs[t] = np.tanh(np.dot(Wxh, xs[t]) + np.dot(Whh, hs[t-1]) +
bh)
ys[t] = np.dot(Why, hs[t]) + by
ps[t] = np.exp(ys[t]) / np.sum(np.exp(ys[t])) # Softmax
return xs, hs, ps

def backward(xs, hs, ps, targets):

dWxh, dWhh, dWhy = np.zeros_like(Wxh),
np.zeros_like(Whh), np.zeros_like(Why)
dbh, dby = np.zeros_like(bh), np.zeros_like(by)
dhnext = np.zeros_like(hs[0])
for t in reversed(range(len(inputs))):
dy = np.copy(ps[t])
dy[targets[t]] -= 1 # Backprop into softmax
dWhy += np.dot(dy, hs[t].T)
dby += dy
dh = np.dot(Why.T, dy) + dhnext # Backprop into hidden
layer
dhraw = (1 - hs[t] * hs[t]) * dh # Backprop through tanh
nonlinearity
dbh += dhraw
dWxh += np.dot(dhraw, xs[t].T)
dWhh += np.dot(dhraw, hs[t-1].T)
dhnext = np.dot(Whh.T, dhraw)
return dWxh, dWhh, dWhy, dbh, dby

for epoch in range(epochs):

hprev = np.zeros((hidden_size, 1)) # Initial hidden state

inputs = data[:-1] # Input sequence
targets = data[1:] # Target sequence (predict the next element)
xs, hs, ps = forward(inputs, hprev)

loss = -np.sum(np.log(ps[t][targets[t], 0]) for t in

range(len(inputs)))

dWxh, dWhh, dWhy, dbh, dby = backward(xs, hs, ps, targets)

# Update weights and biases

Wxh -= learning_rate * dWxh
Whh -= learning_rate * dWhh
Why -= learning_rate * dWhy
bh -= learning_rate * dbh
by -= learning_rate * dby

if epoch % 100 == 0:
print(f"Epoch {epoch}, Loss: {loss}")

hprev = np.zeros((hidden_size, 1)) # Initial hidden state

test_input = data[0].reshape(input_size, 1) # First element of the
sequence
predicted_sequence = [test_input.flatten()]
for t in range(1, sequence_length):
xs, hs, ps = forward([test_input], hprev)
test_input = ps[0] # Predict the next element
predicted_sequence.append(test_input.flatten())

print("\nPredicted sequence:")
for i, element in enumerate(predicted_sequence):
print(f"Element {i+1}: {element}")

3. Given a dataset Music Genre using LSTM. CO4 L3

Dataset: Music Genre

The dataset contains audio files of music tracks along with their
corresponding genre labels. Each audio file is represented as a
sequence of audio samples, and each music track is associated
with a genre label indicating its genre category (e.g., rock, pop,
jazz, classical).

Approach: Long Short-Term Memory (LSTM)

LSTM (Long Short-Term Memory) is a type of recurrent neural

network (RNN) architecture that is well-suited for sequence
modeling tasks, such as time series prediction, natural language
processing, and music generation. In the context of music genre
classification, LSTM can learn to extract features from sequential
audio data and classify music tracks into different genre
categories.

Implementation Steps:

1. Data Preprocessing:
- Load the audio files and their corresponding genre labels.
- Preprocess the audio data by extracting features such as Mel-
Frequency Cepstral Coefficients (MFCCs), spectrograms, or other
representations suitable for audio data.
- Divide the dataset into training, validation, and test sets.

2. Model Architecture:
- Design an LSTM-based neural network architecture for music
genre classification.
- The input to the LSTM network will be the sequence of audio
features extracted from the audio files.
- Add one or more LSTM layers followed by fully connected
layers with appropriate activation functions.
- The output layer will have units equal to the number of genre
categories, with a softmax activation function to output genre
probabilities.

3. Training:
- Train the LSTM model using the training dataset.
- Define appropriate loss function, such as categorical cross-
entropy, and optimizer, such as Adam or RMSprop.
- Monitor the model's performance on the validation set to
prevent overfitting by using techniques like early stopping or
dropout.

4. Evaluation:
- Evaluate the trained LSTM model on the test dataset to
measure its performance in classifying music genres.
- Calculate metrics such as accuracy, precision, recall, and F1-
score to assess the model's classification performance.

5. Deployment:
- Once the LSTM model achieves satisfactory performance,
deploy it in production environments for music genre
classification tasks.
- Integrate the model into applications or systems where music
genre classification is required, such as music streaming platforms
or recommendation systems.

By implementing LSTM for music genre classification, we can

build a model that learns to recognize patterns in sequential audio
data and accurately classify music tracks into different genre
categories, enabling various applications in the music industry.

Solid Starts - First 100 Days
94% (18)
Solid Starts - First 100 Days
287 pages
Hourglass Workout Program by Luisagiuliet 2
76% (21)
Hourglass Workout Program by Luisagiuliet 2
51 pages
12 Week Program: Summer Body Starts Now
89% (45)
12 Week Program: Summer Body Starts Now
70 pages
The Hold Me Tight Workbook - Dr. Sue Johnson
100% (16)
The Hold Me Tight Workbook - Dr. Sue Johnson
187 pages
Read People Like A Book by Patrick King-Edited
62% (66)
Read People Like A Book by Patrick King-Edited
12 pages
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
77% (13)
Livingood, Blake - Livingood Daily Your 21-Day Guide To Experience Real Health
260 pages
Facial Gains Guide (001 081)
91% (45)
Facial Gains Guide (001 081)
81 pages
Cheat Code To The Universe
94% (77)
Cheat Code To The Universe
34 pages
Curse of Strahd
95% (467)
Curse of Strahd
258 pages
The Psychiatric Interview - Daniel Carlat
91% (34)
The Psychiatric Interview - Daniel Carlat
473 pages
The Borax Conspiracy
91% (57)
The Borax Conspiracy
14 pages
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
94% (212)
COSMIC CONSCIOUSNESS OF HUMANITY - PROBLEMS OF NEW COSMOGONY (V.P.Kaznacheev,. Л. V. Trofimov.)
212 pages
The Secret Language of Attraction
86% (107)
The Secret Language of Attraction
278 pages
How To Develop and Write A Grant Proposal
83% (541)
How To Develop and Write A Grant Proposal
17 pages
Workbook For The Body Keeps The Score
88% (52)
Workbook For The Body Keeps The Score
111 pages
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
83% (1016)
Donald Trump & Jeffrey Epstein Rape Lawsuit and Affidavits
13 pages
KamaSutra Positions
78% (69)
KamaSutra Positions
55 pages
7 Hermetic Principles
93% (28)
7 Hermetic Principles
3 pages
27 Feedback Mechanisms Pogil Key
75% (12)
27 Feedback Mechanisms Pogil Key
6 pages
Frank Hammond - List of Demons
92% (92)
Frank Hammond - List of Demons
3 pages
36 Questions That Lead To Love
91% (35)
36 Questions That Lead To Love
3 pages
36 Questions To Fall in Love 1
97% (31)
36 Questions To Fall in Love 1
2 pages
The 36 Questions That Lead To Love - The New York Times
94% (34)
The 36 Questions That Lead To Love - The New York Times
3 pages
100 Questions To Ask Your Partner
80% (35)
100 Questions To Ask Your Partner
2 pages
The 36 Questions That Lead To Love - The New York Times
95% (21)
The 36 Questions That Lead To Love - The New York Times
3 pages
Jeffrey Epstein39s Little Black Book Unredacted PDF
75% (12)
Jeffrey Epstein39s Little Black Book Unredacted PDF
95 pages
ALCHEMIST
64% (14)
ALCHEMIST
4 pages
1001 Songs
71% (69)
1001 Songs
1,798 pages
Zodiac Sign & Their Most Common Addictions
63% (30)
Zodiac Sign & Their Most Common Addictions
9 pages
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
23% (954)
The 4 Hour Workweek, Expanded and Updated by Timothy Ferriss - Excerpt
38 pages
True of False: H12-111-Enu Hcia-Iot V2.5 Exam
100% (4)
True of False: H12-111-Enu Hcia-Iot V2.5 Exam
20 pages
1651 - Vo Nguyen Duy Nam - GCS200888 - Assignment Brief 2
No ratings yet
1651 - Vo Nguyen Duy Nam - GCS200888 - Assignment Brief 2
20 pages
Coincent - Data Science With Python Assignment
100% (2)
Coincent - Data Science With Python Assignment
23 pages
ML Question Bank-1
No ratings yet
ML Question Bank-1
10 pages
ASSIGNMENT 1ML
No ratings yet
ASSIGNMENT 1ML
5 pages
ML Endsem 2022
No ratings yet
ML Endsem 2022
7 pages
EE378A - Combined Notes
No ratings yet
EE378A - Combined Notes
76 pages
University of Gondar: August 2011 E.C Gondar, Ethiopia
No ratings yet
University of Gondar: August 2011 E.C Gondar, Ethiopia
10 pages
Bachelor Thesis Zu Wenig Seiten
100% (3)
Bachelor Thesis Zu Wenig Seiten
7 pages
Pa ZG512 Ec-3r First Sem 2022-2023
No ratings yet
Pa ZG512 Ec-3r First Sem 2022-2023
5 pages
21CS743 Model Question Paper Solution
No ratings yet
21CS743 Model Question Paper Solution
32 pages
ML Final Notes Unit 4,5 Rishi
No ratings yet
ML Final Notes Unit 4,5 Rishi
45 pages
PYQ_ML
No ratings yet
PYQ_ML
8 pages
2_syllabus
No ratings yet
2_syllabus
3 pages
Mini Project PPT, Sumit Malan
No ratings yet
Mini Project PPT, Sumit Malan
12 pages
MACHINE LEARNING
No ratings yet
MACHINE LEARNING
6 pages
What Are The Differences Between Supervised and Unsupervised Learning?
No ratings yet
What Are The Differences Between Supervised and Unsupervised Learning?
21 pages
fda_a3_13642032.pdf
No ratings yet
fda_a3_13642032.pdf
19 pages
Tutorial1_ML_Cyber4
No ratings yet
Tutorial1_ML_Cyber4
3 pages
sourav moocs a2 65
No ratings yet
sourav moocs a2 65
32 pages
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
No ratings yet
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
10 pages
Aam Ut-1 Qb Ans [Final]
No ratings yet
Aam Ut-1 Qb Ans [Final]
26 pages
Engineering Assignment Coversheet Student Number(s) 905460: Asked Yuhe
No ratings yet
Engineering Assignment Coversheet Student Number(s) 905460: Asked Yuhe
5 pages
Green University of Bangladesh Department of Computer Science and Engineering (CSE)
No ratings yet
Green University of Bangladesh Department of Computer Science and Engineering (CSE)
6 pages
3.1 K - Means
No ratings yet
3.1 K - Means
16 pages
C073 AI Assignment2
No ratings yet
C073 AI Assignment2
4 pages
KNN
No ratings yet
KNN
8 pages
Act8
No ratings yet
Act8
20 pages
Mini_project_1 (3)
No ratings yet
Mini_project_1 (3)
4 pages
Analysis and Design of Algorithm Final
No ratings yet
Analysis and Design of Algorithm Final
10 pages
LP III Lab Manual
100% (1)
LP III Lab Manual
8 pages
AAM UT-1 QB ANS
No ratings yet
AAM UT-1 QB ANS
12 pages
2023 Assignment2 SIT744
No ratings yet
2023 Assignment2 SIT744
6 pages
What Are The Differences Between Supervised and Unsupervised Learning?
No ratings yet
What Are The Differences Between Supervised and Unsupervised Learning?
22 pages
ME1_Syllabus_Papers (2)
No ratings yet
ME1_Syllabus_Papers (2)
40 pages
3710216_merged
No ratings yet
3710216_merged
8 pages
Top 90+ Data Science Interview Questions and Answers (2024)
No ratings yet
Top 90+ Data Science Interview Questions and Answers (2024)
38 pages
Meng - 233 - Fall-2020-2021 - Labs Manual - 10 Nov 2020 PDF
No ratings yet
Meng - 233 - Fall-2020-2021 - Labs Manual - 10 Nov 2020 PDF
21 pages
Solved Paper-2024 (Raihan-13017704423) - MUHAMMED RAIHAN
No ratings yet
Solved Paper-2024 (Raihan-13017704423) - MUHAMMED RAIHAN
14 pages
Exercises INF 5860 Solution Hints
No ratings yet
Exercises INF 5860 Solution Hints
11 pages
Assignment1_LATEX
No ratings yet
Assignment1_LATEX
11 pages
DL Unit 3
No ratings yet
DL Unit 3
59 pages
ML U2 Notes
No ratings yet
ML U2 Notes
12 pages
Lecture 2: Basics and Definitions: Networks As Data Models
No ratings yet
Lecture 2: Basics and Definitions: Networks As Data Models
28 pages
AI Phase2
No ratings yet
AI Phase2
13 pages
Assignment
No ratings yet
Assignment
7 pages
machinelearning
No ratings yet
machinelearning
26 pages
UNIT 3 - Final
No ratings yet
UNIT 3 - Final
37 pages
DWM Exp7 C49
No ratings yet
DWM Exp7 C49
11 pages
Agniva
No ratings yet
Agniva
16 pages
6csdsyll
No ratings yet
6csdsyll
48 pages
6cessyll
No ratings yet
6cessyll
50 pages
Classification
No ratings yet
Classification
8 pages
C. Cifarelli Et Al - Incremental Classification With Generalized Eigenvalues
No ratings yet
C. Cifarelli Et Al - Incremental Classification With Generalized Eigenvalues
25 pages
IT445 Project
No ratings yet
IT445 Project
10 pages
ML Mid 1 Ans
No ratings yet
ML Mid 1 Ans
26 pages
QUESTION BANK ,sample paper , and many more
No ratings yet
QUESTION BANK ,sample paper , and many more
43 pages
Project Report 2
No ratings yet
Project Report 2
11 pages
Relatório Machine Learning
No ratings yet
Relatório Machine Learning
24 pages
Bilal Ahmed Shaik Data Mining
No ratings yet
Bilal Ahmed Shaik Data Mining
88 pages
A Study of Classification Algorithms Using Rapidminer
No ratings yet
A Study of Classification Algorithms Using Rapidminer
12 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
DATA STRUCTURES
No ratings yet
DATA STRUCTURES
24 pages
Computer Science Library Management Project
No ratings yet
Computer Science Library Management Project
36 pages
Spare Parts List: Chain Saws 572 XP/XPG
No ratings yet
Spare Parts List: Chain Saws 572 XP/XPG
42 pages
Esa Pro 2 Manual en
No ratings yet
Esa Pro 2 Manual en
36 pages
Proportional Reducing Valves Rzgo, Hzgo, Kzgo: Pilot Operated, ISO 4401 Size 06, 10
No ratings yet
Proportional Reducing Valves Rzgo, Hzgo, Kzgo: Pilot Operated, ISO 4401 Size 06, 10
4 pages
Autocad Civil 3D For Surveyors: Course Length: 2 Days
No ratings yet
Autocad Civil 3D For Surveyors: Course Length: 2 Days
4 pages
[Ebooks PDF] download Clinical Anatomy and Physiology of the Visual System, 4e (Aug 9, 2021)_(0323711685)_(Elsevier) 4th Edition Remington Od Ms Faao full chapters
100% (4)
[Ebooks PDF] download Clinical Anatomy and Physiology of the Visual System, 4e (Aug 9, 2021)_(0323711685)_(Elsevier) 4th Edition Remington Od Ms Faao full chapters
37 pages
acccount_060946
No ratings yet
acccount_060946
5 pages
DHL Project Proposal
No ratings yet
DHL Project Proposal
28 pages
Fishfinder 580 Chartplotter Operations Manual
No ratings yet
Fishfinder 580 Chartplotter Operations Manual
134 pages
Armeña Baloloy Cea Llave Powerplant Design With Renewable Energy Phase 1 Project
No ratings yet
Armeña Baloloy Cea Llave Powerplant Design With Renewable Energy Phase 1 Project
24 pages
98-99 LS400 Owners Manual Dash Light Meanings
No ratings yet
98-99 LS400 Owners Manual Dash Light Meanings
10 pages
SAP Cloud For Customer Extension Guide: Public Document Version: 2002 - 2020-05-02
No ratings yet
SAP Cloud For Customer Extension Guide: Public Document Version: 2002 - 2020-05-02
160 pages
Database Systems Mcq's 150 Marked
No ratings yet
Database Systems Mcq's 150 Marked
21 pages
Bibliografie Teza de Licenta
No ratings yet
Bibliografie Teza de Licenta
1 page
Last M
No ratings yet
Last M
8 pages
2021-BIM PROPOSAL - ALVEO-CERCA PH 3 B+T1 - Docusigned
100% (1)
2021-BIM PROPOSAL - ALVEO-CERCA PH 3 B+T1 - Docusigned
4 pages
Itil
No ratings yet
Itil
125 pages
Footprinting - Putri Erawaty Bakara - N.Priskila Napitupulu - Yanada Sari BR Situmorang
No ratings yet
Footprinting - Putri Erawaty Bakara - N.Priskila Napitupulu - Yanada Sari BR Situmorang
50 pages
Networking Cheat Sheet - by Codelivly
No ratings yet
Networking Cheat Sheet - by Codelivly
5 pages
SG350-28 Datasheet: Get A Quote
No ratings yet
SG350-28 Datasheet: Get A Quote
3 pages
Programming in Java Assignment 1: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
No ratings yet
Programming in Java Assignment 1: NPTEL Online Certification Courses Indian Institute of Technology Kharagpur
10 pages
Fshortcut Keys Description
No ratings yet
Fshortcut Keys Description
2 pages
00 AGENDA Radioss Intro V2019 MAR29-2019
No ratings yet
00 AGENDA Radioss Intro V2019 MAR29-2019
22 pages
Gas Turbine
100% (2)
Gas Turbine
25 pages
Cool-N-Comfort-Data-Sheet-V2 Print
No ratings yet
Cool-N-Comfort-Data-Sheet-V2 Print
4 pages
The Basics of Python For Loops A Tutorial - Learn Data Science With Dataquest
No ratings yet
The Basics of Python For Loops A Tutorial - Learn Data Science With Dataquest
1 page
RICE - Information Sheet and Data Privacy Consent Form
No ratings yet
RICE - Information Sheet and Data Privacy Consent Form
1 page