14 Statistics and Probability
14 Statistics and Probability
14 Statistics and Probability
Date: _________________________
1. Measures of Spread
Solution:
# &
Since there are 6 numbers, the median lies between the % th position = % th position
# &
and the & % + 1)th position = &% + 1) = 4th position.
1 3 4 7 7 9
'()
Median = %
= 5.5
www.ApexEducators.com 1
2. Variance and Standard Deviation
• Variance measures the average degree which each point differs from the mean.
• Standard measures the spread of the data. It is the square root of the variance.
- A smaller value of standard deviation implies a more consistent (less spread) set of
data.
- The formula to compute standard deviation:
∑ $("+"̅ )! ∑("+"̅ )!
- ∑$
or - #
where
o 𝑓 = frequency or 𝑛 = number of data points
o 𝑥 = value of data point
o 𝑥̅ = mean of data set
Example:
20 students each from Class 4A and Class 4B took part in a Mathematics competition.
Every student has to go through a 10-mark quiz and the results are as follows:
Marks 3 4 5 6 7 8 9 10
Number of students 1 0 4 5 4 2 3 1
2 5 7 8 2 8 9 6 9 5
9 10 8 7 6 7 5 6 10 3
www.ApexEducators.com 2
Solutions:
∑ $"
(a) Mean for Class 4A = ∑$
(.×0)(('×1)((2×')((&×2)(()×')((3×%)((4×.)((01×0)
= 0(1('(2('(%(.(0
0.'
= %1
= 6.7
∑"
Mean for Class 4B = #
%(2()(3(%(3(4(&(4(2(4(01(3()(&()(2(&(01(.
= %1
0.%
= %1
= 6.6
∑ $("+"̅ )!
Standard Deviation for Class 4A = - ∑$
23.'
= - %1
∑("+"̅ )!
Standard Deviation for Class 4B = - #
001.3
=- %1
(b) Class 4A has more coherent results as the standard deviation is lower.
www.ApexEducators.com 3
3. Cumulative Frequency Graph
• A typical cumulative frequency graph looks like an ‘S-shaped’ curve as shown below:
www.ApexEducators.com 4
4. Box-and-whisker Diagram
• The following shows a box-and-whisker diagram and the information that can be
derived from the plot:
- M1 = lower quartile
- M2 = Median
- M3 = Upper quartile
- M4 = Lower limit
- M5 = Upper limit
- M3 – M1 = Interquartile range
• Mutually exclusive events are events that cannot occur concurrently. The first event
cannot occur when the second event occurs.
- The probabilities of the first event and the second event are added together.
P(𝐴 or 𝐵) = P(𝐴) + P(𝐵)
www.ApexEducators.com 5
6. Possibility Diagram
• Possibility diagrams, which usually comes in the form of a table, helps to list down all
possible outcomes in an experiment that will be useful in computing probabilities.
Example:
The six sides of a fair yellow dice are labelled with numbers: 11, 12, 15, 16, 19 and 22.
The six sides of a fair blue dice are labelled with a different set of numbers: 9, 14, 16, 18,
19 and 21.
Yellow dice
11 12 15 16 19 22
9 A
14
Blue Dice
16
18 B
19 X
21
www.ApexEducators.com 6
Solutions:
(a)
Yellow dice
11 12 15 16 19 22
9 A A A A A A
14 B B A A A A
Blue Dice
16 B B B X A A
18 B B B B A A
19 B B B B X A
21 B B B B B A
#6789: <$ =
(b) (i) P(𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑦𝑒𝑙𝑙𝑜𝑤 𝑑𝑖𝑐𝑒 > 𝑣𝑎𝑙𝑢𝑒 𝑜𝑓 𝑏𝑙𝑢𝑒 𝑑𝑖𝑐𝑒) = =>> ?<@@A8A>ABA9@
0&
= .&
'
=4
#6789: <$ C
(ii) P(𝑠𝑎𝑚𝑒 𝑣𝑎𝑙𝑢𝑒) = =>> ?<@@A8A>ABA9@
%
= .&
0
= 03
7. Tree Diagram
• A tree diagram is a pictorial method to list down all the different outcomes in an
experiment.
• The tree diagram is more appropriate than the possibility diagram for solving problems
with multi-step scenarios such as experiment on cards with two or more draws.
Example:
There are 12 cans of soft drink A, 8 cans of soft drink B and 15 cans of soft drink C in an
ice box. Ivan picked two cans from the ice box ay random without replacement.
(a) Construct a tree diagram.
(b) Find the probability that the 2 cans of drinks are
(i) soft drink A,
(ii) 1 soft drink A and 1 soft drink B,
(iii) at least 1 soft drink C.
www.ApexEducators.com 7
Solutions:
(a)
Note that the sum of probabilities in any one branch always add up to ‘1’.
0% 3 02
- E.g. the first main branch = .2 + .2 + .2 = 1
www.ApexEducators.com 8
0% 00
(b) (i) P(𝐴 and 𝐴) = .2 × .'
&&
= 242
OR
www.ApexEducators.com 9
Name : _________________________
Date: _________________________
1. You have found the following ages (in years) of 4 bears. Those bears were randomly
selected from the 22 bears at the zoo.
5, 4, 6, 39
(b) ________________
www.ApexEducators.com 10
2. You have found the following ages (in years) of all 6 lions at the Zoo:
13, 2, 1, 5, 2, 7
www.ApexEducators.com 11
3. You have found the following ages (in years) of 5 zebras at the Zoo:
8, 11, 17, 7, 19
www.ApexEducators.com 12
4. 43 kids aged from 5 to 13 attended a medical check-up in a children’s clinic. The
breakdown of each age group is tabulated as shown:
Age (years) 5 6 7 8 9 10 11 12 13
Number of kids 3 5 7 4 4 2 6 7 5
Calculate
(a) the mean.
(b) the standard deviation.
www.ApexEducators.com 13
5. The ordered stem-and-leaf diagram shows the height (in cm) of 30 beauty contestants.
Find
(a) the probability of selecting two beauty contestants who are above 189 cm tall.
www.ApexEducators.com 14
(e) the standard deviation,
6. 30 students took a memory test and their scores are recorded as follows:
Marks, x 4 5 6 7 8 9 10
No. of students 3 7 8 4 5 2 1
Calculate
(a) the mode,
www.ApexEducators.com 15
(c) the median,
(e) the probability of selecting one student scoring below 5 marks and the other scoring 9
marks and above.
Answer: (e) P(one student < 5 marks and one score ≥ 9) = ________________
www.ApexEducators.com 16
7. The time taken for 250 students to complete an artwork is shown in the table.
Calculate
(a) the mean time,
(b) the standard deviation.
www.ApexEducators.com 17
8. 1000 students took a Mathematics examination which consisted of Paper 1 and Paper 2.
Each paper has a total of 50 marks. The results of these students are tabulated as shown.
Number of candidates
Marks(x)
Paper 1 Paper 2
0 < x ≤ 10 90 100
10 < x ≤ 20 310 270
20 < x ≤ 30 370 295
30 < x ≤ 40 200 295
40 < x ≤ 50 30 40
www.ApexEducators.com 18
Paper 2
_____________________________________________________________________
_____________________________________________________________________
(c) Which paper has more coherent results? Explain your answer.
_____________________________________________________________________
_____________________________________________________________________
www.ApexEducators.com 19
9. During the monsoon season (Nov to Jan), the duration of rain was recorded for Year 2019
and Year 2020.
(a) Determine the mean and standard deviation of Year 2019 and Year 2020.
Year 2019
www.ApexEducators.com 20
Year 2020
(b) Which year has a longer duration of rain? Explain your answer.
_____________________________________________________________________
_____________________________________________________________________
(c) Which year is more consistent in terms of the duration of the rain? Explain your answer.
_____________________________________________________________________
_____________________________________________________________________
www.ApexEducators.com 21
10. The stem-and-leaf diagram shows the masses, in grams, of some oranges.
(a) Given that the interquartile range is 10, find the value of m.
www.ApexEducators.com 22
Name : _________________________
Date: _________________________
1. The cumulative frequency graph shows the amount 250 shoppers spent in a doughnut shop.
www.ApexEducators.com 23
2. The cumulative frequency graph shows the mass of 1000 Secondary One students in sports
school.
www.ApexEducators.com 24
3. Two thousand candidates sat for a Mathematics examination. The cumulative frequency
graph below shows the results. The grading system is as follows:
Grades F E D C6 C5 B4 B3 A2 A1
Marks ≤ 39 40 to 44 45 to 49 50 to 55 56 to 59 60 to 64 65 to 69 70 to 74 > 74
(e) If the passing mark is 45, find the number of students who passed the examination
: ______________________
www.ApexEducators.com 25
4. The diagram is a box-and-whisker plot showing the mass of 500 seashells.
5. The diagram is a box-and-whisker plot showing the height of 250 pupils in Primary One.
www.ApexEducators.com 26
6. 1000 runners took part in a marathon and the time they took to complete the run were shown
in the cumulative frequency graph.
(c) Given that 45% of the runners completed the marathon within t hrs, find the value of t.
t = ______________________
www.ApexEducators.com 27
7. Two thousand students reported the number of hours they spent reading books on
weekends. The data is plotted on a cumulative frequency graph as shown below:
(c) the percentage of students who read more than 5 hrs : ______________________
www.ApexEducators.com 28
8. The cumulative frequency below illustrates the weights of 100 students in Senoko High
School.
(b) The weight of 100 students in Changi High School have a higher median but a smaller
interquartile range.
Describe how the cumulative curve for Changi High School will differ from the curve
for Senoko High School.
_____________________________________________________________________
_____________________________________________________________________
www.ApexEducators.com 29
9. The cumulative frequency graph shows the distribution of marks of 150 students in a
Mathematics examination.
www.ApexEducators.com 30
(a) Use the graph to estimate
(i) the number of students who score more than 36 marks: ___________________
(b) (i) Complete the grouped frequency table of the marks of the 150 students:
www.ApexEducators.com 31
(c) The same group of students took a Science Examination.
The box-and-whisker plot shows the distribution of their marks.
_____________________________________________________________________
_____________________________________________________________________
(ii) Compare and comment on the consistency of the performance of the students in
the two examinations.
_____________________________________________________________________
_____________________________________________________________________
10. (a) The following box-and-whisker diagrams shows the distribution of the mass of 300
students from each school, SK Secondary School and HG Secondary School respectively.
www.ApexEducators.com 32
(ii) Compare the mass of the students from SK Secondary School and HG
Secondary School in two ways.
_______________________________________________________________
_______________________________________________________________
(iii) Mary commented that there are more students in HG Secondary School than SK
Secondary School who weigh more than or equal to 70 kg. Do you agree with
Mary? Support with a reason.
_______________________________________________________________
_______________________________________________________________
(b) The cumulative frequency curve shows the height distribution of 80 plants.
(i) Use your graph to find the value of m, if 32.5% of the plants have heights more
than m cm.
www.ApexEducators.com 33
(ii) The height distribution of the 80 plants was also recorded in the following
frequency table.
Height (h cm) Number of plants
60 < h ≤ 70 2
70 < h ≤ 80 a
80 < h ≤ 90 9
90 < h ≤ 100 27
100 < h ≤ 110 23
110 < h ≤ 120 b
120 < h ≤ 130 4
b = _______________
(b) Hence, find the mean and standard deviation of the height of the 80 plants.
www.ApexEducators.com 34
Name : _________________________
Date: _________________________
1. Bag A contains 4 numbered cards labelled 1, 2, 3 and 4. Bag B contains 5 Bingo balls
numbered 5, 6, 7, 8 and 9. Beatrice picked one item from each bag. She recorded the
numbers on the card and ball and added the values of each item.
(a) Construct a possibility diagram to show all possible sums.
(b) Using the possibility diagram, find the probability that the sum of the two items is
(i) odd,
www.ApexEducators.com 35
(iii) less than 8,
2. There are seven numbered cards labelled 10, 11, 13, 15, 17, 18 and 21 in Box P. There are
three numbered cards labelled 4, 5 and 7 in another Box Q. One card is drawn randomly
from each of the box and the values on the cards are added together.
(a) Construct a possibility diagram to show all possible sums.
www.ApexEducators.com 36
(b) Using the possibility diagram, find the probability that the sum of the two items is
(i) odd,
www.ApexEducators.com 37
3. Joel threw two fair dice and recorded the numbers on the dice. He then multiplied the two
numbers.
(a) Construct a possibility diagram to show all possible products.
(b) Using the possibility diagram, find the probability that the sum of the two items is
(i) odd,
www.ApexEducators.com 38
(iv) divisible by 4,
4. A bag contains 4 red marbles, 5 green marbles and 6 blue marbles. Two marbles are drawn
successively without replacement.
(a) Construct a tree diagram.
www.ApexEducators.com 39
(b) Using the tree diagram you have drawn in (a), determine the probability that
(i) the two marbles are red,
www.ApexEducators.com 40
(vi) the two marbles are of different colours.
5. Three types of coloured pens (10 reds, 4 black pens and 11 blue pens) were kept in a bag.
Helen picked two pens at random without replacement.
(a) Construct a tree diagram.
www.ApexEducators.com 41
(b) Using the tree diagram you have drawn in (a), determine the probability that
(i) the two pens are of the same colour,
www.ApexEducators.com 42
(iv) no red,
www.ApexEducators.com 43
6. In a boc of chocolate cookies jar, there were 10 dark chocolates cookies, 12 milk cookies
and 8 white chocolate cookies. Ivan picks two cookies at random.
(a) Construct a tree diagram.
(b) Using the tree diagram you have drawn in (a), determine the probability that
(i) both cookies are of different flavour,
www.ApexEducators.com 44
(ii) there is no milk chocolate cookies,
(iii) there is one milk chocolate cookie and one white cholcate cookie,
7. 50 students were asked how much pocket monet they received and their answers were
tabulated as follows:
Two students are chosen at random one after another. Find the probability that the sum of
their pocket money is
(a) $39 or more,
www.ApexEducators.com 45
(b) $31 or less,
8. 25 students took a memory test and their marks are recorded as follows:
Marks 4 5 6 7 8 9 10
Number of students 3 5 7 3 4 2 1
Two students are selected at random one after another. Find the probability that the sum of
their marks is
(a) exactly 16 marks,
www.ApexEducators.com 46
(b) 19 marks or above,
9. The ordered stem-and-leaf diagram shows the height (in cm) of 30 beauty contestents.
Two contestents are chosen at random without replacement. Find the proabability that
(a) the heights of both contestents are more than 188 cm,
www.ApexEducators.com 47
(b) the heights of both contestents are less than or equal to 175 cm,
(c) the height of one contestent is above 180 cm and the other is below 170 cm,
Answer: (c) P(one is > 180 cm and one is < 170 cm) = _______________
(d) the heights of both contestents are between 175 cm and 180 cm inclusive.
www.ApexEducators.com 48
10. A box contains a number of blue, green and red balls. There are 27 blue and green balls
altogether. A ball is selected at random from the box. The probability of drawing a
green ball is twice the probability of drawing a blue ball. The probability of drawing a red
'
ball is ) .
%
(a) Show that the probability of drawing a green ball is ) .
(c) Two balls are selected at random from the box, with replacement.
Find the probability that both balls are blue.
(d) Two balls are selected at random from the same box, without replacement.
Find the probability that both balls are different colours.
www.ApexEducators.com 49
11. A drawer contains 2 blue socks and 6 white socks. Two socks are taken from the drawer at
random without replacement. If the two socks are different colours, then a third sock is
taken from the drawer. Otherwise, no third sock is taken.
(a) Draw a tree diagram to show the probabilities of the possible outcomes.
(ii) a third sock is taken and it is the same colour as the first sock.
www.ApexEducators.com 50
12. A box contains five slips of paper. Each slip has one of the numbers 4, 6, 7, 8 or 9 written
on it. There are two players for the game. The first player reaches into the box and draws
two slips and adds the two numbers. If the sum is even, the player wins. If the sum is odd,
the player loses. What is the probability that the first player wins.
13. A game is such that a fair die is rolled respectively until a ‘6’ is obtained.
(a) Find the probability that the game ends by the fourth roll.
(b) Suppose now that the game is such that the same die is rolled repeatedly until two ‘6’s
are obtained. Find the probability that
(i) the game ends on the third roll,
(ii) the game ends on the third roll and the sum of the scores is odd.
www.ApexEducators.com 51