IS328 Data Mining-Tutorial 1 Solution
IS328 Data Mining-Tutorial 1 Solution
IS328 Data Mining-Tutorial 1 Solution
Semester 2, 2021
1|Page
Q7. ________analysis divides data into groups that are meaningful, useful, or both.
A. Cluster
B. Association
C. Classification
D. Regression
2|Page
Q14. What is the median price?
A $225
B $325
C $350
D $400
Q17. Monthly rainfall in Suva during the last ten years is an example of a:
A Discrete variable
B Continuous variable
C Qualitative variable
D Random variable
A Discrete variable
B Continuous variable
C Qualitative variable
D Random variable
3|Page
b) 19, 13, 15, 25, and 78
Rearrange: 13, 15, 18, 19, 78
Mean
(a) (19 + 13 + 15 + 25 +18)/5 = 18
(b) (19 + 13 + 15 + 25 +78)/5 = 30
SD
(a) Sqrt(1/5*[(19-18)2 + (13-18)2 + (15-18)2 + (25-18)2 + (18-18)2]) = 4.1
(b) Sqrt(1/5*[(19-30)2 + (13-30)2 + (15-302 + (25-30)2 + (78-30)2]) = 24.35
Are the results sensitive to outliers?
Yes, outliers can affect the mean and SD. However, it does not affect the median.
(a) 15, 21, 21, 21, 23, 25, 25, 26, 28 (odd number of scores)
Median = 0.5 * 9 = 4.5 = 5th score = 23
Mode = 21
(b) 9, 12, 12, 15, 15, 18, 26, 27 (even number of scores)
Median = 0.5 * 8 = 4 = average of 4th and 5th score = (15+15)/2 = 15
Mode = 12 and 15 (bimodal
(c) 12, 15, 17, 18, 19, 22, 26, 27 (even number of scores)
Median = 0.5 * 8 = 4 = average of 4th and 5th score = (18+19)/2 = 18.5
Mode = no mode.
Q3. Suppose that a sample of health data for analysis includes the attribute age.
13, 52, 46, 16, 45, 20, 20, 21, 40, 22, 35, 25, 35, 25, 70, 33, 33, 25, 35, 25, 35, 36, 22, 19, 16,
15, 30
Rearranged:
13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45, 46, 52,
70.
4|Page
(c) What is the mode of the data? Comment on the data's modality (i.e., bimodal,
trimodal, etc.).
This data set has two values that occur with the same highest frequency and is, therefore,
bimodal.
The modes (values occurring with the greatest frequency) of the data are 25 and 35.
(e) Find the first quartile (Q1) and the third quartile (Q3) of the data?
Rearranged:
13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45,
46, 52, 70.
5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80
5|Page