Question Based On Data Set
Question Based On Data Set
QUESTION - A
1. Study the Lungcap data set and answer the following questions.
ii) Given that one randomly selected person is a smoker, what is the
probability that the person is a female?
iii) Are gender and smoking habit independent?
2. Suppose it is given that 20% of the male smokers and 15% of the female
smokers were born caesarean. With the help of the data, verify the above
statements. Give enough reasons for your answers.
3. Plot the histogram of the distribution of Lungcap amongst smokers.
4. Plot the histogram of the distribution of Height amongst smokers.
5. Are height and Lungcap independent?
6. Are the variation of Lungcap of male smokers and female smokers equal?
7. Are the average of Lungcap of smokers and non-smokers equal?
8. Plot the histogram of the age amongst smokers.
9. What percentage of people below 16 years smoke?
10.What percentage of people above 17 years smoke?
11.Test if smoking habit and age are dependent.
12.Test if smoking habit and Lungcap are dependent.
13.Fit a suitable distribution to height and also to Lungcap. Test the
goodness of fit.
QUESTION – B
Study the car data set and answer the following questions.
1. Find the average and variance of price and mileage separately. Comment
on the results. How will you interpret the result statistically?
2. Test if the mean mileage of different car manufacturers within some price
range are equal. Clearly specify all the assumptions and the null and
alternative hypotheses.
3. Find a 90% confidence price range for the Chevrolet cars.
4. Find a 90% confidence for variance of prices for Pontiac cars.
QUESTION – C
1. Let Ý be the mean of a random sample of size n1from N ( μ , σ 2=10). Find n1
such that the probability of the random interval ( Ý −1/2, Ý +1/2) includes
μ is approximately 0.954.
2. Let Ź be the mean of a random sample of size n2 from N ( μ , σ 2=9 ) . Find n2
such that the probability of the random interval ( Ź−1 , Ź +1) includes μ is
approximately 0.90.
3. Draw 200 random samples each of size n1 (found above) from a normal
distribution with mean 5 and variance 3.
4. Write down the distribution of the sample mean. Test using the data
obtained in Q3 above, if the sample means follow that distribution.
5. Draw 200 random samples each of size n2 (found above) from a normal
distribution with mean 7 and variance 3.
Compute 95% confidence interval for the difference of means from each of the
200 samples. Draw a graph to show all 200 confidence intervals and comment
QUESTION – D
1. Collect stock prices for 5 companies from 1st Jan 2016 to 30th June 2016.
2. Plot the histogram of the returns for each company. Describe the
histograms.
3. Test whether the average returns for 5 companies are equal. State clearly
the assumptions required, null and alternative hypotheses.
4. Test whether the average returns for each pair of companies are equal.
5. Comment on the results.