SS Lab Questions
SS Lab Questions
2. Three thousand six hundred people who work in Bradford were asked about the
means of transport that they used for daily commuting. The data collected is
shown in Table 1.19.
Mr P 2045 votes
Mr Q 4238 votes
Mrs R 8605 votes
Ms S 12012 votes
68 64 75 82 68 60 62 88 76 93 73 79 88 73 60 93
71 59 85 75 61 65 75 87 74 62 95 78 63 72 66 78
82 75 94 77 69 74 68 60 96 78 89 61 75 95 60 79
83 71 79 62 67 97 78 85 76 65 71 75 65 80 73 57
88 78 62 76 53 74 86 67 73 81 72 63 76 75 85 77
16.91 9.65 22.68 12.45 18.24 11.79 6.48 12.93 7.25 13.02
8.10 3.25 9.00 9.90 12.87 17.50 10.05 27.43 16.01 6.63
14.73 8.59 6.50 20.35 8.84 13.45 18.75 24.10 13.57 9.18
9.50 7.14 10.41 12.80 32.09 6.74 11.38 17.95 7.25 4.32
8.31 6.50 13.80 9.87 6.29 14.59 19.25 5.74 4.95 15.90
6. Obtain a scatter plot for the data in Table 1.36 and comment on whether there
is a link between road deaths and the number of vehicles on the road. Would
you expect this to be true? Provide reasons for your answer.
Countries Vehicles per 100 Road Deaths per
population 100,000 population
Great Britain 31 14
Belgium 32 30
Denmark 30 23
France 46 32
Germany 30 26
Irish Republic 19 20
Italy 35 21
Netherlands 40 23
Canada 46 30
U.S.A. 57 35
7. Obtain a scatter plot for the data in Table 1.37 that represents the passenger miles
flown by a UK based airline (millions of passenger miles) during the period 2003-
2004. Comment on the relationship between miles flown and quarter.
9. The following are the IQs of 12 people: 115, 89, 94, 107, 98, 87, 99, 120, 100, 94,
100, 99. It is claimed that 'the average person in the group has an IQ of over
100'. Is this a reasonable assertion?
10 A sample of six components was tested to destruction, to establish how long they
would last. The times to failure (in hours) during testing were 40, 44, 55, 55, 64,
and 69. Which would be the most appropriate average to describe the life of these
components? What are the consequences of your choice?
11. a. Find the mean, median and mode of the following set of data: 1, 1, 1, 1, 1,
2, 2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 4, 4, 4, 4, 5, 5, 5, 5, 5.
. b. The average salary paid to graduates in three companies is: £7000, £6000,
and £9000 per annum respectively. If the respective number of graduates in these
companies is 5, 12, and 3, find the mean salary paid to the 20 graduates.
22 16 26 33 33 37 9 23 32 17
20 13 12 18 19 10 21 22 25 22
22 22 34 24 23 21 38 31 41 20
Table 2.9
(a) Plot the histogram and visually comment on the shape of the weekly
expenditure. Hint: use class width of 5.
(b) Calculate the values of the mean and median.
(c) Use descriptive statistics in conjunction with the histogram to comment on weekly
expenditure.
a. Find the mean length of this sample by hand and by using a spreadsheet.
b. Construct the cumulative frequency graph and use this to estimate the median.
c. Check the value of the median using the formula method.
Length Frequency
4.0 - 4.2 4
4.3 - 4.5 9
4.6 - 4.8 13
4.9 - 5.1 20
5.2 - 5.4 34
15. Greendelivery.com has recently
5.5 - 5.7 18
decided to review the weekly mileage of the
delivery vehicles used to deliver shopping purchased online to customer homes from
a central parcel depot. The sample data collected is part of the first stage in analysing
the economic benefit of potentially moving all vehicles to bio-fuels from diesel.
(a) Use Excel to construct a frequency distribution and plot the histogram with
class intervals of 10 and classes 75-84, 85-94, ….., 175-184. Comment on the
pattern in mileage travelled by the company vehicles.
(b) Use the raw data to determine the mean, median, standard deviation, and semi
inter quartile range.
(c) Comment on which measure you would use to describe the average and
measure of dispersion. Explain using your answers to (a) and (b).
(d) Calculate the measure of skewness and kurtosis and comment on the
distribution shape.
16. The manager at BIG JIMS restaurant is concerned about the time it takes to process
credit card payments at the counter by counter staff. The manager has collected the following
processing time data (time in minutes/seconds) (Table 2.21) and requested that summary
statistics are calculated.
Plant A Plant B
6.72 10.13 9.31 7.83 9.93 8.10 6.27 8.54
9.83 7.38 9.36 9.23 10.36 7.81 9.69 8.51
7.15 6.93 7.23 8.70 9.06 7.58 8.01 9.54
7.72 9.32 8.32 10.65 8.08 8.35 7.78 9.08
9.20 8.70 9.32 8.09 9.82 6.51 8.33 7.01
11.36 8.50 8.86 10.06 9.56 7.98 8.94 7.06
6.38 7.99 9.34 6.62 7.81 6.62 9.82 9.26
9.57 7.23 8.91 10.74 7.27 8.14 9.45 10.26
Table 6.2
(a) For the given samples conduct an appropriate hypothesis test to test that the sample mean
values are not different at the 5% level of significance.
(b) If the sample means are not significantly different test whether the population mean is 8.3
days (choose sample A to undertake the test).
20. The Indian restaurant manager has employed two new delivery drivers and wishes to
assess their performance. The data in Table 6.3 represent the delivery times for person A
and B undertaken on the same day.
Person
Person A B
32.9 25.6 36.2 34.6 30.3 31.6 25.5 36.5 36.0 36.3
29.4 33.5 32.5 40.7 32.7 25.5 28.1 38.8 32.4 32.8
41.2 35.6 40.8 32.4 35.3 34.2 37.5 33.3 25.9 37.7
40.3 34.6 30.2 37.1 31.0 33.4 32.3 33.2
39.3 36.5 35.0 32.7 35.5 32.6 31.9 36.8
30.3 35.7 40.2 34.2 36.5 34.0 35.9 25.1
37.5 38.0 33.4 33.2 36.1 41.4 29.0 37.6
45.0 30.7 37.8 37.7 28.9 29.8 34.3 34.4
Based upon your analysis of the two samples is there any evidence that the delivery times
are different (test at 5%).
21. A tyre manufacturer conducts quality assurance checks on the tyres that it manufactures. One of
the tests consists of undertaking a test on their medium quality tyres with an independent
random sample of 12 tyres providing a sample mean and standard deviation of 14,500 km and
800 km respectively. Given that the historical average is 15,000 km and that the population is
normally distributed, test whether the sample would raise a cause for concern.
22. A new low-fat fudge bar is advertised as having 120 calories. The manufacturing company
conducts regular checks by selecting independent random samples and testing the sample
average against the advertised average. Historically the population varies as a normal
distribution and the most recent sample consists of the numbers: 99, 132, 125, 92, 108, 127,
105, 112, 102, 112, 129, 112, 111, 102, and 122. Is the population value significantly different
from 120 calories (significance level 5%)?
23. During a national election a national newspaper wanted to assess whether there was a
similar voting pattern for a particular party between two towns in the north-east of
England. The sample results are illustrated in Table 6.4..
Town A Town B
Number interviewed, N 456 345
Intention to vote for party, n 243 212
Table 6.4
Airport A Airport B
Total number of items
processed, N 15596 25789
Number of items of
luggage misplaced, n 123 167
Table 6.5
Assess whether there is a significant difference in misplaced luggage between the
two airports (test at 5%).
25. A university finance department would like to compare the travel expenses claimed by staff
attending conferences. After initial data analysis the finance director has identified two
departments who seem to have very different levels of claims. Based upon the data
provided (Table 6.7), undertake a suitable test to assess whether the level of claims from
department A is significantly greater than that from department B. You can assume that the
population expenses data are normally distributed and that the population standard
deviations are approximately equal.
Department A Department B
156.67 146.81 147.28 140.67 108.21 109.10 127.16
169.81 143.69 157.58 154.78 142.68 110.93 101.85
130.74 155.38 179.89 154.86 135.92 132.91 124.94
158.86 170.74
Table 6.7
26. A university finance department would like to compare the travel expenses claimed by staff
attending conferences. After initial data analysis the finance director has identified two
departments who seem to have very different levels of claims. Based upon the data
provided (Table 6.7), undertake a suitable test to assess whether the level of claims from
department A is significantly greater than that from department B. You can assume that the
population expenses data are normally distributed and that the population standard
deviations are approximately equal.
Department A Department B
156.67 146.81 147.28 140.67 108.21 109.10 127.16
169.81 143.69 157.58 154.78 142.68 110.93 101.85
130.74 155.38 179.89 154.86 135.92 132.91 124.94
158.86 170.74
Assume Unequal variances. Are the expenses claimed by department A significantly
different to department B?
27. Choko Ltd provides training to its salespeople to aid the ability of each salesperson to
increase the value of their sales. During the last training session 15 salespeople attended
and their weekly sales before and sales after are provided in Table 6.8.
Assuming that the populations are normally distributed, assess whether there is any evidence
that the training improves sales (test at 5% and 1%).
28. Concern has been raised at the standard achieved by students completing final year project
reports within a university department. One of the factors identified as important is the
research methods (RM) module mark achieved, which is studied before the students start their
project. The department has now collected data for 15 students as given in Table 6.9.
Student RM Project
1 38 71
2 50 46
3 51 56
4 75 44
5 58 62
6 42 65
7 54 50
8 39 51
9 48 43
10 14 62
11 38 66
12 47 75
13 58 60
14 53 75
15 66 63
Table 6.9
Assuming that the populations are normally distributed is there any evidence to suggest that the
marks are different (test at 5%).
29. A university finance department would like to compare the travel expenses claimed by staff
attending conferences. After initial data analysis the finance director has identified two
departments who seem to have very different levels of claims. Based upon the data
provided (Table 6.7), undertake a suitable test to assess whether the level of claims from
department A is significantly greater than that from department B. You can assume that the
population expenses data are normally distributed and that the population standard
deviations are approximately equal.
Department A Department B
156.67 146.81 147.28 140.67 108.21 109.10 127.16
169.81 143.69 157.58 154.78 142.68 110.93 101.85
130.74 155.38 179.89 154.86 135.92 132.91 124.94
158.86 170.74
we assumed that the two population variances are equal. Conduct an appropriate test
to check if the variances are equal (test at 5%)?
30. An estate agent is interested in developing a model to predict the house sales price based
upon two other variables: size of property and age. His initial analysis suggests a
multiple model regression would be appropriate, with the relationship between the
dependent and independent variables being linear. Table 8.21 presents the data set.
31. Fit an appropriate equation to the data set (Table 8.15) to predict the examination mark
given the assignment mark for 14 undergraduate students.
a. Construct a scatter plot and comment upon the possible relationship between the two
variables.
b. Calculate the product moment correlation coefficient between vehicle numbers and road
deaths.
c. Use your answers to (a) and (b) to comment upon your results.
33. Samples of student’s essays were marked by two tutors independently. The resulting ranks
are shown in Table 8.7.
A 5 8 1 6 2 7 3 4
Tutor
B 7 4 3 1 6 8 5 2
a. Calculate the rank correlation coefficient.
b. State any conclusions that you can draw.
34. The mathematics and statistics examination marks for a group of ten students are shown
in Table 8.8.
Mathematics 89 73 57 53 51 49 47 44 42 38
Statistics 51 53 49 50 48 21 46 19 43 43
Table
(a) Find the product moment correlation coefficient for the two sets of marks.
(b) Place the marks in rank order and calculate the rank correlation coefficient.
(c) The following is a quotation from a statistics text ‘Rank correlation can be used to
give a quick approximation to the product moment correlation coefficient.’
Comment on this in the light of your results.
35. A teacher of 40 university students studying the application of Excel within a business
context is concerned that students are not taking a group work assignment seriously. This is
deemed to be important given that the group work element is contributing to the
development of personal development skills. To assess whether or not this is a problem the
module tutor devises a simple experiment which judged the individual level of cooperation
by each individual student within their own group. In the experiment a rating scale is
employed to measure the level of cooperation: 1 = limited cooperation, 5 = moderate
cooperation and 10 = complete cooperation. The form of the testing consists of an initial
observation, a lecture on working in groups, and a final observation. Given the raw data in
Table 7.18 conduct a relevant test to assess whether or not we can observe that cooperation
has changed significantly (assess at 5%).
5, 8 4, 6 3, 3 6, 5 8, 9 10, 9 8, 8 4, 8 5, 5 8, 9
3, 5 5, 4 6, 5 4, 4 7, 8 7, 9 9, 9 8, 7 5, 8 5, 6
8, 7 8, 8 3, 4 5, 6 6, 7 4, 8 7, 8 9, 10 10, 10 8, 9
8, 8 4, 6 4, 5 7, 8 5, 7 7, 9 8, 10 3, 6 5, 6 7,8
36. A company is planning to introduce new packaging for a product that has used the same
packaging for over 20 years. Before it makes a decision on the new packaging it decides to
ask a panel of 20 participants to rate the current and proposed packaging (using a rating
scale of do not change 0 – change 100) (Table 7.20). Is there any evidence that the new
packaging is more favourably received compared with the older packaging (assess at 5%)?
Machinist
1 2 3 4 5 6 7 8 9 10
Before 49 34 30 46 37 28 48 40 42 45
After 22 23 32 24 23 21 24 29 27 27
11 12 13 14 15 16 17 18 19 20
Before 29 45 32 44 49 28 44 39 47 41
After 23 29 37 22 33 27 35 32 35 24
21 22 23 24 25 26 27 28 29 30
Before 33 38 35 35 47 47 48 35 41 35
After 37 37 24 23 23 37 38 30 29 31
Table 7.21
38.An agriculture officer wants to study the effect of 4 different fertilizers on the specific crop. The
corresponding data are as shown below. Check whether there is significant difference between yields
of different fertilizers used in kruskal wallis test at 0.01 los. Table value is 11.345.
39. Check whether there is any significant difference between three hospitals at 5% level of
significance.
A B C
87 45 66
55 60 50
72 65 55
76 50 88
48 55 72
67 78 65
65 68 84
58 57 77
66 54 48
71 80 56
68 53 54
73 45 72
86 68 60
78 78 56
68 59 53
40. The sale of new homes is tied closely to the level of confidence within the financial markets. A
developer builds new homes in two European countries (A and B) and is concerned that there
is a direct relationship between the country and the interest rates obtainable to build
properties. To provide answers the developer decides to undertake market research to see
what interest rates would be obtainable if he decided to borrow €300,000 over 20 years from 5
financial institutions in country A and 8 financial institutions in country B. Based upon the
data in Table 7.22 do we have any evidence to suggest that the interest rates are significantly
different?
A: 10.20 10.97 10.63 10.70 10.50 10.30 10.65
10.25 10.75 11.00
B: 10.60 10.80 11.40 10.90 11.10 11.20 10.89
10.78 11.05 11.15 10.85 11.16 11.18
Table 7.22
41. The petrol prices during the summer of 2008 has raised concerns with new car sellers that
potential customers were taking prices into account when choosing a new car. To provide
evidence to test this possibility a group of five local car showrooms agreed to ask fleet
managers and individual customers during August 2008 whether they were or were not
influenced by petrol prices. The results were shown in Table 7.9.
42. A business analyst has been asked to confirm the effectiveness of a marketing campaign on
people’s attitudes to global warming. To confirm that the campaign was effective a group of
500 people were randomly selected from the population and asked the simple question about
whether they agree that national governments should be concerned with an answer of ‘Yes’
or ‘No’. The results are as shown in Table 7.10.