Lab 8 Activities Solution

Uploaded by

Mansi

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Lab 8 Activities Solution

Uploaded by

Mansi

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Lab #8 Activities

Solution

Instructions:
• Fill in your name in line 3.
• Knit to pdf to read the questions in a more readable format.
• Fill in the code chunks below and answer the questions with text responses. It is recommended that you
knit to pdf after you fill in each code chunk. Be sure when adding in text responses to never copy-paste
symbols from outside of the document.
• If you install any packages, do so in the R console, not in code chunks. The library() function must
appear in a code chunk if you will use a function from a package that is not part of base R.
• Your responses must use code that was covered in class; other methods to solve the problems will not
be accepted.
• Submit your knit pdf file to Crowdmark.
A reminder that the R code we have covered in class is available on our STAT 2150 A01 UM Learn page,
under Content > Course Material.
Your knit pdf file should show the result answering each question. To do this, after creating an R object, you
should also print it in a new line within the code chunk.
Question 1:
Suppose that a book editor finds that the number of typos per 100 pages of a manuscript follows a Poisson
distribution with λ = 3.
(a) Write the R code that calculates the probability of more than 3 typos in a 100-page manuscript.
1- ppois(3,3)

## [1] 0.3527681
# or:
1 - dpois(0,3) - dpois(1,3) - dpois(2,3) - dpois(3,3)

## [1] 0.3527681
(b) Write the R code that generates the number of typos in 20 sets of 100-page manuscripts. Store the
generated data in a vector called data20. Find the proportion of the 20 sets that have more than 3
typos. Do not change the seed in the below code chunk from 123.
set.seed(123)
data20 = rpois(20,3)
length(which(data20 > 3))/20

## [1] 0.4
(c) Repeat part (b), this time generating the number of typos in 100 sets of 100-page manuscripts. Store
the generated data in a vector called data100. Find the proportion of the 100 sets that have more than
3 typos. Again, do not change the seed from 123.

1
set.seed(123)
data100 = rpois(100,3)
length(which(data100 > 3))/100

## [1] 0.37
(d) Explain how your results in parts (b) and (c) relate to your result from part (a).
With a larger sample size (part (c) compared to part (b)), the sample proportion gets closer to the true
probability (part (a)).
(e) We would like to determine the number of typos per 100 pages such that 75% of the time, the number
of typos is less than or equal to this number. Write the R code that determines this number for the
population of all 100-page manuscripts.
qpois(0.75,3)

## [1] 4
(f) Write the R code that finds the data value x in the data20 vector such that 75% of the values in the
vector are less than or equal to x. Repeat with the data100 vector.
sorted_data20 = sort(data20)
sorted_data20[15] # 15 is 75% of 20

## [1] 5
# or:
quantile(data20,0.75)

## 75%
## 5
sorted_data100 = sort(data100)
sorted_data100[75] # 75 is 75% of 100

## [1] 4
# or:
quantile(data100,0.75)

## 75%
## 4
(g) Write the R code that calculates the probability of finding 3 typos in a 200-page manuscript.
dpois(3,6)

## [1] 0.08923508
(h) Simulate the number of typos in 75 such 200-page manuscripts. Then calculate what proportion of the
75 generated values have 3 typos. Do not change the seed in the below code chunk from 123.
set.seed(123)
data = rpois(75,6)
length(which(data == 3))/75

## [1] 0.09333333
(i) Suppose a random sample of 5 authors each have 100-page manuscripts. Write the R code that calculates
the probability that 3 of the 5 authors each have 1 typo in their manuscripts.

2
prob = dpois(1,3)
dbinom(3,5,prob)

## [1] 0.02411037
Question 2:
Suppose we have a normally distributed variable X with mean 100 and standard deviation 10.
(a) Generate a sample of 500 observations from this distribution and calculate how many of the 500
observations are between 105 and 115. Do not change the seed in the below code chunk from 123.
set.seed(123)
data = rnorm(500,100,10)
length(which(data < 115 & data > 105))

## [1] 112
(b) Calculate the probability that a randomly selected observation is between 105 and 115. Based on this
probability, calculate how many observations in a sample of 500 do you expect to be between 105 and
115.
prob = pnorm(115,100,10)-pnorm(105,100,10)
prob

## [1] 0.2417303
prob*500

## [1] 120.8652
Question 3:
A student wrote the following code to sample from a discrete distribution with some probabilities for the
various values of X, using the inversion method of sampling:
u = runif(1000,0,1)
x = numeric(1000)
for(i in 1:1000){
if(u[i] < 0.35){
x[i] = -10
} else if(u[i] < 0.60){
x[i] = 0
} else if(u[i] < 0.90){
x[i] = 10
} else{
x[i] = 15
}
}

Select a sample of the same size from a discrete distribution, using the same support and same probabilities
that were used above, this time using the sample() function. Store the sample in a vector, but do not print
the vector, as the output will be long.
data = sample(c(-10,0,10,15),1000,replace=TRUE,prob=c(0.35,0.25,0.30,0.10))

Question 4:
Suppose we take a random sample of size n from a normal distribution with unknown mean µ and unknown

3
variance σ 2 . Consider two different estimators of σ 2 :
n
1 X
σ̂12 = (xi − x)2
n − 1 i=1

and
n
1X
σ̂22 = (xi − x)2
n i=1

Note that σ̂12 is the same as the well-known sample variance s2 , implemented in R with the var() function.
Let us explore why we prefer to use σ̂12 as an estimator of σ 2 rather than σ̂22 .
Consider taking a sample of size 25 from the standard normal distribution (where we know the population
variance is 1):
rnorm(25,0,1)

Now repeat this process over and over again 1,000 times, and for each of these samples, calculate s2 = σ̂12 and
n−1 2
σ̂22 . (Note that σ̂22 = σ̂1 .) We then have 1,000 estimates of σ 2 = 1 using s2 = σ̂12 and 1,000 estimates of
n
σ 2 = 1 using σ̂22 . (Of course, if we already know σ 2 = 1, there would be no point in estimating it, but we are
trying to assess which of these two estimators performs better so we have to assume we know the value of
σ 2 .) The code is provided in the below code chunk:
set.seed(123)
samples = vector("list",length=1000)
for(i in 1:1000){
samples[[i]] = rnorm(25,0,1)
}
s2 = sapply(samples,var) # 1000 estimates using sˆ2
n = 25
sigma2squared_hat = (n-1)/n*s2 # 1000 estimates using the other estimator

Now find the average of the 1,000 estimates of σ 2 using s2 = σ̂12 and the average of the 1,000 estimates of σ 2
using σ̂22 .
mean(s2)

## [1] 1.00403
mean(sigma2squared_hat)

## [1] 0.963869
What do these results indicate about the performance of s2 = σ̂12 and σ̂22 for estimating σ 2 ?
The average value of sˆ2 is about 10 times closer to sigmaˆ2 = 1 than the average value of the other estimator.
So it seems sˆ2 is a better estimator of sigmaˆ2.

In Sem 2 Study Material
No ratings yet
In Sem 2 Study Material
19 pages
L
No ratings yet
L
8 pages
SMM Lab Ex 1 A) B) C)
No ratings yet
SMM Lab Ex 1 A) B) C)
3 pages
Homework-2
No ratings yet
Homework-2
8 pages
Point Estimation and Interval Estimation
No ratings yet
Point Estimation and Interval Estimation
4 pages
CS246 Hw1
No ratings yet
CS246 Hw1
5 pages
Lab 6 Activities
No ratings yet
Lab 6 Activities
4 pages
Mathematical Computations Using R
No ratings yet
Mathematical Computations Using R
53 pages
Computational Techniques in Statistics: Exercise 1
No ratings yet
Computational Techniques in Statistics: Exercise 1
5 pages
MLR Example 2predictors
No ratings yet
MLR Example 2predictors
5 pages
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
No ratings yet
Workshop 5: PDF Sampling and Statistics: Preview: Generating Random Numbers
10 pages
Simple Statistics Functions in R
No ratings yet
Simple Statistics Functions in R
41 pages
R Exercises
No ratings yet
R Exercises
35 pages
Programming With R Test 2
50% (2)
Programming With R Test 2
5 pages
R Remaing PRGMS
No ratings yet
R Remaing PRGMS
9 pages
Experiment-6
No ratings yet
Experiment-6
7 pages
Matlab Assignment-01 SEM-II-2016-2017 PDF
No ratings yet
Matlab Assignment-01 SEM-II-2016-2017 PDF
5 pages
Data Analysis
No ratings yet
Data Analysis
8 pages
Assignment_2--3-
No ratings yet
Assignment_2--3-
4 pages
R Programming Student Lab Manual-52-63-3-12
No ratings yet
R Programming Student Lab Manual-52-63-3-12
10 pages
Final Exam For Computer Simulation SOLUTION : Good Luck!!! Problem #1
No ratings yet
Final Exam For Computer Simulation SOLUTION : Good Luck!!! Problem #1
8 pages
Stat Inference CPP 1
No ratings yet
Stat Inference CPP 1
3 pages
Lab 2 Additional MATLAB Features, Properties of Signals and Systems, Convolution
No ratings yet
Lab 2 Additional MATLAB Features, Properties of Signals and Systems, Convolution
18 pages
Comp 08 Sol
No ratings yet
Comp 08 Sol
4 pages
Math10282 Ex05 - An R Session
No ratings yet
Math10282 Ex05 - An R Session
6 pages
Data Structure and Algorith
No ratings yet
Data Structure and Algorith
8 pages
Chapter6 MATLAB PDF
No ratings yet
Chapter6 MATLAB PDF
23 pages
Monte Carlo Simulation 101-1
No ratings yet
Monte Carlo Simulation 101-1
2 pages
MATLAB-Fall 11-12 Introduction To MATLAB Part I
No ratings yet
MATLAB-Fall 11-12 Introduction To MATLAB Part I
33 pages
PHD Econ, Applied Econometrics 2021/22 - Takehome University of Innsbruck
No ratings yet
PHD Econ, Applied Econometrics 2021/22 - Takehome University of Innsbruck
20 pages
Homework 3 R Tutorial: How To Use This Tutorial
No ratings yet
Homework 3 R Tutorial: How To Use This Tutorial
8 pages
MATLAB MATLAB Lab Manual Numerical Methods and Matlab
80% (5)
MATLAB MATLAB Lab Manual Numerical Methods and Matlab
14 pages
X X X X X X
No ratings yet
X X X X X X
2 pages
Exercises Question
No ratings yet
Exercises Question
30 pages
CH5530-Simulation Lab: MATLAB Additional Graded Session 01/04/2020
No ratings yet
CH5530-Simulation Lab: MATLAB Additional Graded Session 01/04/2020
3 pages
Fds Answers
No ratings yet
Fds Answers
53 pages
IBM322 Last Year ETE
No ratings yet
IBM322 Last Year ETE
5 pages
VB Script - QTP
No ratings yet
VB Script - QTP
51 pages
Assignment1 PDF
No ratings yet
Assignment1 PDF
2 pages
Bioestadistica: Clara Carner 2023-05-29
No ratings yet
Bioestadistica: Clara Carner 2023-05-29
4 pages
Labsheet_1
No ratings yet
Labsheet_1
4 pages
Homework 1
No ratings yet
Homework 1
2 pages
Assignment 2
No ratings yet
Assignment 2
4 pages
Transient Response Assignment 2010
No ratings yet
Transient Response Assignment 2010
6 pages
Tutorial - On - Sage-Math 2024-08-29 13 - 46 - 43
No ratings yet
Tutorial - On - Sage-Math 2024-08-29 13 - 46 - 43
1 page
WS1
No ratings yet
WS1
5 pages
assignment
No ratings yet
assignment
7 pages
Dec2017 - Python
No ratings yet
Dec2017 - Python
6 pages
Birla Institute of Technology & Science, Pilani EEE G613: Advanced Digital Signal Processing Semester I: 2021-2022
No ratings yet
Birla Institute of Technology & Science, Pilani EEE G613: Advanced Digital Signal Processing Semester I: 2021-2022
6 pages
COMP5070 Week 02 Practice
No ratings yet
COMP5070 Week 02 Practice
19 pages
CO4 (10) SEM R
No ratings yet
CO4 (10) SEM R
12 pages
Piecewise Linear Regression Examples (Lesson 1) Truncated
No ratings yet
Piecewise Linear Regression Examples (Lesson 1) Truncated
4 pages
Algo Assignment4
No ratings yet
Algo Assignment4
7 pages
Assignment 1
No ratings yet
Assignment 1
16 pages
Matlab_Basics-VSC-2024.02.01
No ratings yet
Matlab_Basics-VSC-2024.02.01
37 pages
MultivariateRGGobi PDF
No ratings yet
MultivariateRGGobi PDF
60 pages
Lab-10-Forest-Regression
No ratings yet
Lab-10-Forest-Regression
5 pages
Microsoft Interview Preparation in Just 30 Days
No ratings yet
Microsoft Interview Preparation in Just 30 Days
58 pages
Fast mental calculation tricks
From Everand
Fast mental calculation tricks
EasyMath
No ratings yet
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Unit 5
No ratings yet
Unit 5
185 pages
Unit 2 2000
No ratings yet
Unit 2 2000
99 pages
Unit 3 2000
No ratings yet
Unit 3 2000
111 pages
Unit 4 2000
No ratings yet
Unit 4 2000
50 pages
Lab 7 - Shell
No ratings yet
Lab 7 - Shell
6 pages
Lab 8 - Shell
No ratings yet
Lab 8 - Shell
6 pages
Lab 6 - Shell
No ratings yet
Lab 6 - Shell
7 pages
Lab 5 - Shell
No ratings yet
Lab 5 - Shell
7 pages
Social Networking Pros and Cons
No ratings yet
Social Networking Pros and Cons
12 pages
Mash Content New Locations - November 2022 - EN v2
No ratings yet
Mash Content New Locations - November 2022 - EN v2
19 pages
AVEVA DiagramsInfo PDF
No ratings yet
AVEVA DiagramsInfo PDF
4 pages
Model469280 Ender 3 NG Corexy Beta#Preview
No ratings yet
Model469280 Ender 3 NG Corexy Beta#Preview
1 page
Thesis RWTH Aachen
100% (3)
Thesis RWTH Aachen
5 pages
eyeOS PDF
No ratings yet
eyeOS PDF
7 pages
PHD Thesis Education PDF
100% (2)
PHD Thesis Education PDF
5 pages
Paul Brook Alchemical Tools
100% (5)
Paul Brook Alchemical Tools
196 pages
COF C02 Demo
No ratings yet
COF C02 Demo
4 pages
Revolutionizing STEM Learning With Digital Notebooks
No ratings yet
Revolutionizing STEM Learning With Digital Notebooks
3 pages
1600_HP_DEMU_VOL_I_Draft
No ratings yet
1600_HP_DEMU_VOL_I_Draft
24 pages
C_LCNC_2406-Demo
No ratings yet
C_LCNC_2406-Demo
5 pages
Master Thesis Tu Delft Citg
100% (3)
Master Thesis Tu Delft Citg
5 pages
Convert - XFDL - To - PDF - Zip: Converting IBM Lotus Forms Viewer (.XFDL) Files To Adobe PDF Format in Batches
No ratings yet
Convert - XFDL - To - PDF - Zip: Converting IBM Lotus Forms Viewer (.XFDL) Files To Adobe PDF Format in Batches
2 pages
(Ebook) Natural Language Processing Recipes: Unlocking Text Data with Machine Learning and Deep Learning Using Python by Akshay Kulkarni, Adarsha Shivananda ISBN 9781484273500, 1484273508 All Chapters Instant Download
100% (10)
(Ebook) Natural Language Processing Recipes: Unlocking Text Data with Machine Learning and Deep Learning Using Python by Akshay Kulkarni, Adarsha Shivananda ISBN 9781484273500, 1484273508 All Chapters Instant Download
81 pages
Witness Statement Coursework
100% (2)
Witness Statement Coursework
5 pages
PHD Thesis University of ST Andrews
100% (2)
PHD Thesis University of ST Andrews
7 pages
Medical Assistant Resume Objectives Samples
100% (2)
Medical Assistant Resume Objectives Samples
5 pages
AutoReader Technical Proposal
No ratings yet
AutoReader Technical Proposal
10 pages
15. Analyzing Malicious PDF file using PDF Stream Dumper
No ratings yet
15. Analyzing Malicious PDF file using PDF Stream Dumper
7 pages
Proteus Software Installation
No ratings yet
Proteus Software Installation
77 pages
Application Process - 10222022
No ratings yet
Application Process - 10222022
108 pages
BAR EXAMINATIONS APPLICATION PROCESS Supreme Court of The Philippines
No ratings yet
BAR EXAMINATIONS APPLICATION PROCESS Supreme Court of The Philippines
1 page
h15523 Adme Configuration Steps and Screenshots With Networker As Backup Software
No ratings yet
h15523 Adme Configuration Steps and Screenshots With Networker As Backup Software
31 pages
Ensayos en Contra de La Eutanasia
100% (1)
Ensayos en Contra de La Eutanasia
4 pages
MIS Framework - SoP-FRS - InvestHaryna - V4
No ratings yet
MIS Framework - SoP-FRS - InvestHaryna - V4
19 pages
leveraging ms office with ai in boosting productivity
No ratings yet
leveraging ms office with ai in boosting productivity
57 pages
Cómo Escribir Un Ensayo para La Aplicación de Georgetown
100% (1)
Cómo Escribir Un Ensayo para La Aplicación de Georgetown
4 pages
Ai Tools
No ratings yet
Ai Tools
6 pages
AUTOSAR_SWS_CANDriver
No ratings yet
AUTOSAR_SWS_CANDriver
106 pages
GS108T and GS110TP Smart Switch Software Administration Manual
No ratings yet
GS108T and GS110TP Smart Switch Software Administration Manual
302 pages