Assignment Mtech
Assignment Mtech
5. List down the names of some popular Activation Functions used in Neural Networks.
13. What is the difference between Forward propagation and Backward Propagation in Neural
Networks?
16. How does the learning rate affect the training of the Neural Network?
21. Why are Deep Neural Networks preferred over Shallow Neural Networks?
22. Overfitting is one of the most common problems every Machine Learning practitioner faces.
Explain some methods to avoid overfitting in Neural Networks.
23. What is the difference between Epoch, Batch, and Iteration in Neural Networks?
24. Suppose we have a perceptron having weights corresponding to the three inputs have the
following values:
w1 = 2 ; w2 = −4; and w3 = 1
Calculate the output value y of the given perceptron for each of the following input patterns:
Pattern P1 P2 P3 P4
X1 1 0 1 1
X2 0 1 0 1
X3 0 1 1 1
25. Consider a feed-forward Neural Network having 2 inputs(label -1 and label -2 )with fully
connected layers and we have 2 hidden layers:
A weight on the connection between nodes i and j is represented by wij, such as w24 is the
weight on the connection between nodes 2 and 4. The following lists contain all the weights
values used in the given network:
w13=−2, w35=1, w23 = 3, w45 = −1, w14 = 4, w36 = −1, w24=−1, w46=1
where v denotes the weighted sum of a node. Each of the input nodes (1 and 2) can only receive
binary values (either 0 or 1). Calculate the output of the network (y5 and y6) for the input pattern
given by (node-1 and node-2 as 0, 0 respectively).
Part 2:
5. Explain false negative, false positive, true negative and true positive with a simple example.
11. Is it better to have too many false positives or too many false negatives? Explain.
13. What is the difference between Gini Impurity and Entropy in a Decision Tree?
15. What is Overfitting? And how do you ensure you’re not overfitting with a model?
19. Running a binary classification tree algorithm is quite easy. But do you know how the tree
decides on which variable to split at the root node and its succeeding child nodes?
20. You are given a cancer detection data set. Let’s suppose when you build a classification
model you achieved an accuracy of 96%. Why shouldn’t you be happy with your model
performance? What can you do about it?
21. You are working on a time series data set. Your manager has asked you to build a high
accuracy model. You start with the decision tree algorithm since you know it works fairly well
on all kinds of data. Later, you tried a time series regression model and got higher accuracy than
the decision tree model. Can this happen? Why?
22. Suppose you found that your model is suffering from low bias and high variance. Which
algorithm you think could tackle this situation and Why?
23. You are given a data set. The data set contains many variables, some of which are highly
correlated and you know about it. Your manager has asked you to run PCA. Would you remove
correlated variables first? Why?
24. You are asked to build a multiple regression model but your model R² isn’t as good as you
wanted. For improvement, you remove the intercept term now your model R² becomes 0.8 from
0.3. Is it possible? How?
25. You’re asked to build a random forest model with 10000 trees. During its training, you got
training error as 0.00. But, on testing the validation error was 34.23. What is going on? Haven’t
you trained your model perfectly?
26. ‘People who bought this also bought…’ recommendations seen on Amazon is based on
which algorithm?
27. A jar has 1000 coins, of which 999 are fair and 1 is double headed. Pick a coin at random,
and toss it 10 times. Given that you see 10 heads, what is the probability that the next toss of that
coin is also a head?
28. How would you predict who will renew their subscription next month? What data would you
need to solve this? What analysis would you do? Would you build predictive models? If so,
which algorithms?
29. There’s a game where you are asked to roll two fair six-sided dice. If the sum of the values
on the dice equals seven, then you win $21. However, you must pay $5 to play each time you roll
both dice. Do you play this game? And in the follow-up: If he plays 6 times what is the
probability of making money from this game?
For each option, what is the expected number of ads shown in 100 news stories?
If we go with option 2, what is the chance a user will be shown only a single ad in 100 stories?
What about no ads at all?