Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
25 views

ITS Assignment 2

Uploaded by

Sugar Cane
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
25 views

ITS Assignment 2

Uploaded by

Sugar Cane
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 8

Assignment 2

Introduction To Statistics
Section 97051/97084
Due Date: 05/10/2024

Question 1: The networks for the top 20 television shows, as determined by the Nielsen
Ratings for the week ending October 26, 2008, are shown in the following table.

CBS ABC CBS ABC ABC

Fox CBS CBS Fox CBS

ABC CBS CBS CBS Fox

Fox Fox CBS Fox ABC

a) Determine the mode of the data.


b) Decide whether it would be appropriate to use either the mean or the median as a
measure of center. Explain your answer.

Question 2: Compute the mean, the median, and the mode for the following data.

Question 3: A small accounting firm pays each of its five clerks $35,000, two junior
accountants $80,000 each, and the firm’s owner $320,000. What is the mean salary paid at
this firm? How many of the employees earn less than the mean? What is the median salary?

For the data given in Questions 4 and 5:

a) Obtain and interpret the quartiles.


b) Determine and interpret the interquartile range.
c) Find and interpret the five-number summary.
d) Identify potential outliers, if any.
e) Construct and interpret a boxplot.

Question 4: The Great Gretzky. Wayne Gretzky, a retired professional hockey player, played
20 seasons in the National Hockey League (NHL), from 1980 through 1999. S. Berry
explored some of Gretzky’s accomplishments in “A Statistician Reads the Sports Pages”
(Chance, Vol. 16, No. 1, pp. 49–54). The following table shows the number of games in
which Gretzky played during each of his 20 seasons in the NHL.
79 80 80 80 74

80 80 79 64 78

73 78 74 45 81

48 80 82 82 70

Question 5: A sample of 14 California counties yielded the following percentages of children


under 18 living with grandparents.

5.9 4.0 5.7 5.1 4.1 4.4 6.5

4.4 5.8 5.1 6.1 4.5 4.9 4.9

Question 6: Compute the mean for the following numbers.

7 -25 90 -3 -6 -7 -4 -5 2 -8

Question 7: Compute the 35th percentile, the 55th percentile, Q1, Q2, and Q3 for the
following data.

16 28 29 13 17 20 11 34 32 27 25 30 19 18 33

Question 8: Compute P20, P47, P83, Q1, Q2, and Q3 for the following data.

Question 9: A local community center is organizing a series of workshops to help residents


improve their digital skills. They are looking into how attendees are spread across different
age groups to better tailor the content of these workshops. After conducting a survey, they
categorized the attendees into different age brackets with the following data:

Calculate 𝑄3, 𝐷7 and 𝑃20 from the following grouped data and interpret each in the context
of age groups and attendees.

Age group Attendees

2-4 3

4-6 4
6-8 2

8 - 10 1
Question 10: In a study of apparently healthy children aged 6 to 60 months in Papua New
Guinea, CRP was measured in 90 children. The units are milligrams per liter (mg/l). Here
are the data from a random sample of 40 of these children:

(a) Find the five-number summary for these data.


(b) Make a boxplot.
(c) Make a histogram.
(d) Write a summary of the major features of this distribution. Do you prefer the
boxplot or the histogram for these data?

Question 11: Consider the following four data sets.

a. Compute the mean of each data set.


b. Although the four data sets have the same means, in what respect are they
quite different?
c. Which data set appears to have the least variation? the greatest variation?
d. Compute the range of each data set.
e. Use the defining formula to compute the sample standard deviation of each
data set.
f. From your answers to parts (d) and (e), which measure of variation better
distinguishes the spread in the four data sets: the range or the standard
deviation? Explain your answer.
g. Compute the coefficient of variation of each data set.
h. Are your answers from parts (c) and (e) consistent?

Question 12: Use your calculator or computer to find the sample variance and sample
standard deviation for the following data.

57 88 68 43 93

63 51 37 77 83

66 60 38 52 28
34 52 60 57 29

92 37 38 17 67

within µ ±𝑘σ for each value of 𝑘 ?


Question 13: According to Chebyshev’s theorem, at least what proportion of the data will be

1. 𝑘 = 2

2. 𝑘 = 2. 5

3. 𝑘 = 1. 6

4. 𝑘 = 3. 2

Question 14: If the mean of a population is 250 and its standard deviation is 20,
approximately what proportion of observations is in the interval between each pair of
values?

1. 190 and 310


2. 210 and 290

Question 15: A set of data is mounded, with a mean of 450 and a variance of 625.
Approximately what proportion of the observations is

1. greater than 425?


2. less than 500?
3. greater than 525?

Question 16: The annual percentage returns on common stocks over a 7-year period were
as follows:

4. 0% 14. 3% 19. 0% − 14. 7% − 26. 5% 37. 2% 23. 8%

Over the same period the annual percentage return on U.S Treasury Bills were as follows

6. 5% 4. 4% 3. 8% 6. 9% 8. 0% 5. 8% 5. 1%

1. Compare the means of these two population distributions.


2. Compare the standard deviations of these two population distributions
3. Compute the coefficient of variation of these two population distributions to check the
consistency.

Question 17: A company produces lightbulbs with a mean lifetime of 1,200 hours and a
standard deviation of 50 hours.

a. Describe the distribution of lifetimes if the shape of the population is unknown.


b. Describe the distribution of lifetimes if the shape of the distribution is known to be
bell-shaped.

Question 18: Consider the company which produces lightbulbs with a mean lifetime of 1,200
hours and a standard deviation of 50 hours.

a. Find the z-score for a lightbulb that lasts only 1,120 hours.
b. Find the z-score for a lightbulb that lasts 1,300 hours.

Question 19: Following is a random sample of seven (x, y) pairs of data points:

a. Compute the covariance.


b. Compute the correlation coefficient.

Question 20: 8 River Hills Hospital is interested in determining the effectiveness of a new
drug for reducing the time required for complete recovery from knee surgery. Complete
recovery is measured by a series of strength tests that compare the treated knee with the
untreated knee. The drug was given in varying amounts to 18 patients over a 6-month
period. For each patient the number of drug units, X, and the days for complete recovery, Y,
are given by the following (x, y) data:

a. Compute the covariance.


b. Compute the correlation coefficient.
c. Briefly discuss the relationship between the number of drug units and the recovery
time. What dosage might we recommend based on this initial analysis?

Question 21: Calculate quartile deviation and coefficient of quartile deviation for continuous
grouped data.
Question 22: Two plants C and D of a factory show the following results about the number
of workers and the wages paid to them.

Average monthly $2500 $2500


wages

Standard deviation 9 10
Using coefficient of variation formulas, find in which plant, C or D is there greater variability
in individual wages.

Question 23: Find which data set given below is more consistent.
Data set 1 1 7 12 15 20 22 28

Data set 2 1 15 15 15 15 16 28

Question 24: Compute all the coefficient of mean deviation and coefficient of variation for the
given data set and compare your results. Check which data set has less delay time.

Minutes of delay Shop A Shop B


---------------- ------ ------
5-10 20 15
10-15 25 20
15-20 30 30
20-25 25 15
25-30 20 10
30-35 10 5

Question 25:
Question 26:

Question 27:

Question 28:

You are working with data from three different departments in a company, each tracking the number
of days it takes employees to complete a project over the last month. The data for each department
is as follows:

Department 1 (Marketing):
12, 15, 14, 10, 13, 16, 11
(Number of days taken to complete projects by employees in the marketing department)

Department 2 (Sales):
22, 25, 20, 19, 23, 21, 18, 30, 17
(Number of days taken to complete projects by employees in the sales department)

Department 3 (Product Development):


8, 12, 9, 5, 11, 7, 4, 16, 2
(Number of days taken to complete projects by employees in the product development department)

1. Construct a boxplot for each department based on the number of days employees took to
complete projects.
2. Compare the three boxplots by discussing:
Median and quartiles (central tendency)
Interquartile range(spread)
Presence of any outliers

Question 29:

Question 30:

Question 31:

You might also like