Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
50% found this document useful (2 votes)
361 views

Sma2217 Tutorial 2

This document provides instructions and problems for a statistics tutorial worksheet. It includes 14 problems covering a range of statistical concepts like confidence intervals, hypothesis testing, correlation, and regression analysis. The problems use data from quality control, exam scores, packaging machines, student assessments, repair costs, blood tests, tax returns, task completion times, sugar weights, coin flipping, employee training times, process control, sales data, and exam preparation.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
50% found this document useful (2 votes)
361 views

Sma2217 Tutorial 2

This document provides instructions and problems for a statistics tutorial worksheet. It includes 14 problems covering a range of statistical concepts like confidence intervals, hypothesis testing, correlation, and regression analysis. The problems use data from quality control, exam scores, packaging machines, student assessments, repair costs, blood tests, tax returns, task completion times, sugar weights, coin flipping, employee training times, process control, sales data, and exam preparation.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

SMA2217 Tutorial Worksheet 2

Instruction: Attempt these problems before your tutorial class.


1. A quality control manager takes a random sample of 100 packets of biscuits from a
production line in order to check the mean weight of the whole production. The net
weights he found are tabulated below:

Weight in grams Frequency


Less than 247 0
247 and less than 248 4
248 and less than 249 21
249 and less than 250 40
250 and less than 251 27
251 and less than 252 7
252 and less than 253 1
Over 253 0

a) Estimate the mean and standard deviation of this production.


b) Using the results from (a), calculate a 95% confidence interval for the mean of the whole
production. Is this satisfactory for the filling of ‘250 g’ packets? Explain.

2. After an initial one month training course, new supervisors at Company X sit a
comprehensive examination. The marks are considered to be normally distributed with a
mean mark of 82 and a standard deviation of 5.2. The minimum pass mark is 75. Company X
believes that the mean mark of the new supervisors might be altering. A random sample of
20 was taken from them and analysed giving a mean mark of 77. Assuming the standard
deviation remains unchanged, calculate a 95% confidence interval and state, with a reason,
whether these results support Company X's claim or not.

3. A source of interest to your management is whether the net weights of boxes of biscuits
coming from three packaging machines are the same. A random sample was taken from each
with the following results: (weights in grams)

Machine 249 257 252 256 253 257 258 253 248
A
Machine 251 245 247 250 248 246 250 250 246
B
Machine 250 254 251 248 257 256 248 250 251
C

a) Calculate a 95% confidence interval for the mean net weight of all the packets coming
from machine A.
b) You are given that the confidence interval for the mean net weight of all the packets coming
from machine B is: 286.4 to 249.8 g
i) Does this interval support the claim that the mean net weight is 250g? Explain.
ii) Does this interval support the claim that machine A and B produce equal net
weights? Explain.

c) Carry out a t-test, at 5% significance, to test the claim that machine C is producing a
mean net weight of 250 g.

d) The overall weight of the production is normally distributed with a mean of 250.9g.
Bearing this in mind, write a short report on the production coming from the three
machines.

4. Practical and theory marks for a random sample of 10 engineering students were as follows:

Student A B C D E F G H I J
Practical 63 83 54 63 71 45 68 68 62 52
Theory 50 75 60 56 63 32 55 69 67 43

a) Calculate a 95% confidence interval for each type of assessment separately.

b) Use these two intervals to investigate whether there is a difference between the mean
marks produced by the different types of assessment.

c) Calculate a confidence interval for the mean difference between the marks for each
student and interpret the result.

d) Carry out an appropriate t-test in order to test the hypothesis, at 5%, that both types of
assessment produce the same marks, on average.

e) Comment briefly on the findings from all the previous parts.

5. The management of a large national chain of motels decided to estimate the mean cost
per room of repairing damages made by its customers during a holiday weekend. A random
sample of 150 vacated rooms was inspected by the management. Its analysis estimated the
mean repair cost to be $28.10 and the sample standard deviation to be $12.40. Construct a
95% confidence interval for the mean repair cost, µ, of all its rooms.

6. Repeated assessments on a chemical determination of human blood during a laboratory


analysis are known to be normally distributed. Ten assessments on a given sample of blood
yielded the values:
1.00 0.95 1.01 1.00 1.04 0.96 1.05 1.02 1.01 1.02
Find a 99% confidence interval for the true chemical determination in the blood for repeated
assessments of the sample.

7. A survey of 672 audited tax returns showed that 448 resulted in additional payments.
Construct a 95% confidence interval for the true percentage of all audited tax returns that
result in additional payments.

8. Firm A claims that it pays its clerical staff on average more per week than its rival,
Firm B. Firm B disputes the claim so each examine a random sample of the salaries paid to
their workers with the following results:
Firm A Mean $343.00 Standard deviation $13.20 n = 40
Firm B Mean $338.50 Standard deviation $14.30 n = 50
Test an appropriate hypothesis at 1 % level of significance.

8. The same eleven workers performed a task using two different methods. The completion
times, in minutes, for each task is given below:
Worker 1 2 3 4 5 6 7 8 9 10 11
Method A 15.2 14.6 14.2 15.6 14.9 15.2 15.6 15.0 16.2 15.7 15.6

Method B 14.5 14.8 13.8 15.6 15.3 14.3 15.5 15.0 15.6 15.2 14.8

Construct a 99% confidence interval for the average of the time differences and interpret your
result.

9. Eleven cartons of sugar, each nominally containing 1 kg, were randomly selected from a
large batch of cartons. The weights of sugar they contained were:
1.02 1.05 1.08 1.03 1.00 1.06 1.08 1.01 1.04 1.07 1.00
Does this support the hypothesis, at 5%, that the mean weight for the whole batch is over
1.00kg?

10. A coin is suspected of being biased. It is tossed 200 times and 114 heads occur. Carry out
a
hypothesis test to see if the coin is indeed biased at 1% significance.

11. The times taken to complete a task were recorded for a sample of eight employees both
before and after a period of training: (min)
Employee 1 2 3 4 5 6 7 8
Before training 15 14 18 14 15 17 13 12
After training 13 15 15 13 13 16 14 12

Test at 5% to see if the training is effective or not.


12. A jar filling process is known to have an associated standard deviation of  = 6g net
weight. Suppose that with a declared (label) weight on each jar of 135g, process engineers
have set a target mean net weight at 135 + 3  = 139.8g Suppose further that in a routine
check of flling process performance, intended to detect any changes of the process mean from
its target value, a sample of 25 jars produced a sample mean fill weight of 139.0g. What does
this value of x have to say about the plausibility of the current process mean actually being a
the target mean of 139.8g? Is there evidence that we are at less than the target mean? Use a
formal testing procedure with a significance level of 0.05.
13. A sample of size n = 100 produced the sample mean of X = 16. Assuming the population
standard deviation  = 3, compute a 95% confidence interval for the population mean
14. Assuming the population standard deviation  = 3, how large should a sample be to
estimate the population mean X with a margin of error not exceeding 0.5?

15. The sales director of your company would like to investigate how the Annual sales relates to
the Sales floor area of a store. Data has been collected from a random sample of twelve of their
stores for the year 2013.

Sales floor area (m2) Annual sales ($00 000)


180 23.6
140 23.2
300 33.0
260 31.4
280 33.0
140 26.0
220 27.6
280 26.4
200 30.0
350 40.0
250 34.0
300 36.0
x = 2900 y = 364.2 x2 = 749 400 y2 = 11 345.88 xy = 91204

a) Plot a scatter diagram of the data.

b) Calculate the value of the correlation coefficient and test it for significance at the 5% level.

c) Calculate the regression equation for predicting Annual sales from Sales floor area. and plot the
line on your scatter diagram. Interpret the value of the gradient of this line.

d) Calculate the goodness of fit statistic and interpret its value.

e) Use the equation found in part (c) to predict the Annual sales for a store whose Sales floor area is
320 m2.
16. A trainee manager wondered whether the length of time his trainees revised for an
examination had any effect on the marks they scored in the examination. Before the exam, he
asked a random sample of them to honestly estimate how long, to the nearest hour, they had
spent revising. After the examination he investigated the relationship between the two
variables.

Trainee A B C D E F G H I J
Revision time 4 9 10 14 4 7 12 22 1 17

Exam mark 31 58 65 73 37 44 60 91 21 84

a) Plot the scatter diagram in order to inspect the data.


b) Calculate the correlation coefficient and test it for significance. Comment on your
answer.
c) Calculate the regression model to fit the data.
d) Plot the regression line on the scatter diagram.
f) Predict the examination mark for a trainee who revises for 15 hours.
g) Predict the examination mark for a trainee who revises for 35 hours.
h) Do you have any reservations about your answer to (g)?

17. The output (in thousands) and profits per unit of output is given in the following table:
Output (X Profit t per unit
thousands) of output (Y)
5 1.7
7 2.4
9 2.8
11 3.4
13 3.7
15 4.4

i) Draw the scatter diagram of Y on X.


ii) Calculate b and a and draw the resultant line on your scatter diagram.
iii) Use your regression equation to estimate the profit per unit of output when the output is
10,000.
iv) Calculate the correlation coefficient r and explain its meaning.
v) Evaluate the model fit

18. Four laboratories, A, B, C and D, are used by food manufacturing companies for making
nutrition analyses of their products. The following data are the fat contents (in grams) of the
same weight of three similar types of peanut butter.
Laboratory

Peanut Butter A B C D

Brand 1 16.6 17.7 16.0 16.3

Brand 2 16.0 15.5 15.6 15.9

Brand 3 16.4 16.3 15.9 16.2

Analyse the data at 5% significance by carrying out a one-way ANOVA to see if there is a
difference between the fat content of the three brands if laboratory is not taken into account.

19. Water samples were taken at three different locations in a river to determine whether the
quantity of dissolved oxygen, a measure of water pollution, varied from one location to
another. Location one was selected above an industrial plant; location two was adjacent to the
industrial water discharge for the plant; and location three was slightly downriver in
midstream. Five water specimens were randomly selected at each location. The data are
shown in the accompanying table (The greater the pollution, the lower will be the dissolved
oxygen readings).

Location 1 5.9 6.1 6.3 6.1 6.0


Location 2 4.8 5.0 4.3 4.7 5.1
Location 3 6.0 6.1 5.8 5.6 5.7

Analyse the data and draw appropriate conclusions..

20. Four salesmen in a company are competing for the title ‘Salesman of the year’. Each has
the task of selling ‘product X’ in three different types of location. Their resulting sales, in
$’000, were as follows:

Salesmen
Area A B C D
1 52.8 49.4 58.6 42.9
2 60.1 48.1 61.0 50.3
3 62.0 56.4 63.3 61.2
Is there any significant difference, at a 5% level, between the sales of the men if the location
is not taken into account?

You might also like