DSE 2151 24 Sep 2022
DSE 2151 24 Sep 2022
DSE 2151 24 Sep 2022
Type: MCQ
Q1. 2. A variable called ‘Grade in exam’ with values coded as : A+, A, B, C ... is a ___________
variable (0.5)
1. Ordinal
2. ratio
3. continuous
4. Dichotomous
1. Histogram
2. Box Plot
3. Scatter Plot
4. Bar chart
Q4. 10. For the following set of values : 3,4,7,2,3,7,4,2,4,7,4, The Q3 value is ______ (0.5)
1. 4
2. 7
3. 3
4. 26
Q5. _______ is a measure that quantifies the lack of symmetry in a data distribution. (0.5)
1. Kurtosis
2. Mean
3. Skewness
4. Median
Q6. A member of the Data Analytics project who has specific knowledge of the subject or business
problem is called a ___________ (0.5)
1. Consumer
2. IT Expert
3. Subject Matter Expert
4. Supplier
Q7. _______ is the process where an estimate is calculated for some variable that is unknown.
(0.5)
1. Prediction
2. Summarization
3. Exploration
4. Association
Q8. If the frequency distribution is approximately normal, approximately 95% of all observations fall
within __________ standard deviations of the mean. (0.5)
1. One
2. Two
3. Three
4. Four
Q9. What is the objective of a hypothesis test ? (0.5)
1. To make some assumptions about the population.
2. To determine if change in one variable directly causes a change in another variable.
3. To generalize our sample data to suitable situations or population.
4. To determine if change in one variable indirectly causes a change in another variable.
Q10. A variable called ‘Color of Car’ with values coded as : 5-Black, 4-Brown, 3-Grey is a __________
variable (0.5)
1. Continuous
2. Ordinal
3. Discrete
4. Dichotomous
Type: DES
Q11. How does a data analyst identify noisy data? What strategy can be adopted to treat the
following (Mention atleast 1 strategy for each sub division):
i. noisy numeric data
To treat noisy numeric data, Strategies include
[0.5 marks]
Defining Null Hypothesis and Alternative hypothesis [0.5 marks]
. (2)
ii. Create a contingency table to tabulate the Country of Origin and Number of Cylinders.
0.5 mark
Country Number of Number of Number of Total
of Cylinders Cylinders Cylinders
Origin
4 6 8
1 1 2 3 6
2 2 0 0 2
3 2 0 0 2
Totals 5 2 3 10
iii. Find the correlation between Horse power and Weight and comment on the relationship
between the variables.
iv. Visualize the relationship between Horse power and Weight using a scatter plot.
0.5 mark for calculation of Mean 94.3, 2576, Std Deviation 38.17, 77.84
1 mark For correct computation of numerator , Correlation Cofficient 0.939
0.5 mark for inference(strong positive correlation) & scatter plot
. (3)
Q14. Consider a data set with the values 250,370, 420, 605, 1100. Perform Data transformation on
each of the above values with the :
i. Mean Normalization method
-0.3517, -0.2015, -0.1517, 0.06588, 0.648
ii. Min-max normalization method by setting min = 1 and max = 10
1,2.27,2.8, 4.758, 10
iii. decimal scaling method
0.025, 0.0370, 0.042, 0.0605, 0.11
1 mark each. (3)