0% found this document useful (0 votes)

48 views

Lab 8 - Shell

This document describes a lab assignment on testing population proportions. The learning objectives are to test whether a population proportion is equal to a given value and to test whether two population proportions are equal. The document provides exercises to import datasets and use R functions like prop.test() to perform proportion tests. Hypotheses are stated and test decisions are made based on p-values and confidence intervals.

Uploaded by

Mansi

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

48 views

Lab 8 - Shell

Uploaded by

Mansi

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Lab 8 - Proportion Testing

Mansi Kumari (7908159)

2023-03-22

Learning Objectives

By the end of this lab, you should have a grasp on the following concepts:

• How to test whether a population proportion is equal to a given value.

• How to test whether two population proportions are equal to each other.

Instructions

To complete this worksheet, add code as needed into the R code chunks given below. Do not delete the
question text. All text should be in complete English sentences. Be sure to change the author of this file to
reflect your name and student number.
To properly see the questions, knit this .Rmd file to .pdf and view the output. You will have a link in your
email that takes you to the Crowdmark submission page. Once you have completed the worksheet, knit it
to .pdf and upload your output to Crowdmark.

1
Exercises
Import the React1000 dataset, which contains various measurements on a sample of 1000 Grade 12 students
across the United States, including their Region, Gender, Age, Handedness, Height (cm), Foot Length (cm),
and Armspan (cm).

React1000 <- read.csv("~/Downloads/LAB8/React1000.csv")

Use str to see the format of the dataset.

str(React1000)

## ’data.frame’: 1000 obs. of 7 variables:

## $ Region : chr "CA" "PA" "CO" "PA" ...
## $ Gender : chr "Female" "Female" "Female" "Male" ...
## $ Age : num 16 17 17 17 17 16 17 17 17 17 ...
## $ Handed : chr "Right-Handed" "Right-Handed" "Left-Handed" "Right-Handed" ...
## $ Height : int 164 163 157 175 175 177 163 179 151 186 ...
## $ Footlength: num 25 23 21 25 24 27 24 27.5 19 25 ...
## $ Armspan : num 165 158 156 182 180 ...

In 1992, a well-known study estimated that 11.1% of Americans aged 10 to 86 are left- or mixed-handed
(LMH). Suppose that we wish to test at the α = 0.01 level whether the proportion of Americans who are
LMH has changed since this estimate, using React1000 as our sample.
Give the hypotheses for this test.

H0 : p = 0.111 vs Ha : p ̸= 0.111

Use the table function to find the number of students in this sample who are LMH.

table(React1000$Handed)

##
## Ambidextrous Left-Handed Right-Handed
## 44 80 876

Calculate the test statistic for this test.

z.stat <- (0.124 - 0.111)/sqrt((0.111 * 0.889)/1000)

z.stat

## [1] 1.308673

Use pnorm to calculate the p-value for this test.

2 * pnorm(-z.stat)

## [1] 0.1906453

2
What is your decision regarding this test?
As the value of p is more than level of significance , we fail to reject H0.
Repeat the above test using the prop.test function.

prop.test(124, 1000, 0.111, alternative = "two.sided",

correct = FALSE)

##
## 1-sample proportions test without continuity correction
##
## data: 124 out of 1000, null probability 0.111
## X-squared = 1.7126, df = 1, p-value = 0.1906
## alternative hypothesis: true p is not equal to 0.111
## 95 percent confidence interval:
## 0.1050000 0.1458777
## sample estimates:
## p
## 0.124

Use the prop.test function to produce a 99% confidence interval for the true proportion of American citizens
who are LMH.

prop.test(124, 1000, 0.111, alternative = "two.sided",

conf.level = 0.99, correct = FALSE)

##
## 1-sample proportions test without continuity correction
##
## data: 124 out of 1000, null probability 0.111
## X-squared = 1.7126, df = 1, p-value = 0.1906
## alternative hypothesis: true p is not equal to 0.111
## 99 percent confidence interval:
## 0.09960635 0.15335021
## sample estimates:
## p
## 0.124

Exercise: Load in the Company500 dataset. This dataset contains various measurements on
a sample of 500 employees from a large company, including their age bracket (Age.Bracket:
either over or under 40), employment status (Status: either salaried or hourly), department,
and earnings bracket. Use the table function to obtain a count of how many employees are
hourly vs. salaried.

Company500 <- read.csv("~/Downloads/LAB8/Company500.csv")

table(Company500$Status)

##
## Hourly Salaried
## 351 149

3
Exercise: Perform a test at the 5% level of significance to determine whether the proportion
of employees who are salaried is below one-third, which is known to be the rate in a competing
company. Give whether you reject or fail to reject H0 .

prop.test(149, 500,1/3, alternative = "less",

correct = FALSE)

##
## 1-sample proportions test without continuity correction
##
## data: 149 out of 500, null probability 1/3
## X-squared = 2.809, df = 1, p-value = 0.04687
## alternative hypothesis: true p is less than 0.3333333
## 95 percent confidence interval:
## 0.000000 0.332659
## sample estimates:
## p
## 0.298

As the value of p is below the level of significance , we reject H0. We have sufficient evidence at the 5% level
of significance that the proportion of employees who are salaried are less than at the competing company.
Exercise: Calculate a 99% interval for the true proportion of employees who are salaried at
this company. Print out the confidence interval below.

prop.test(149, 500,1/3, alternative = "two.sided",conf.level = 0.99,

correct = FALSE)

##
## 1-sample proportions test without continuity correction
##
## data: 149 out of 500, null probability 1/3
## X-squared = 2.809, df = 1, p-value = 0.09374
## alternative hypothesis: true p is not equal to 0.3333333
## 99 percent confidence interval:
## 0.2482371 0.3530537
## sample estimates:
## p
## 0.298

c( 0.2482371, 0.3530537)

## [1] 0.2482371 0.3530537

The p value is above 1% level of significance so we fail to reject H0 and the confidence interval is [0.2482,
0.3531]
The 1992 study referenced earlier also found that the proportion of boys who are LMH is greater than
the proportion of girls who are LMH. Suppose that we wish to test this on our sample, at the 2% level of
significance.
Give the hypotheses for this test.

4
H0 : pM = pF vs Ha : pM > pF

Use the table function to compare counts of LMH by gender:

table(React1000$Gender,React1000$Handed)

##
## Ambidextrous Left-Handed Right-Handed
## Female 13 40 464
## Male 31 40 412

Use table to count the number of girls and boys in this sample:

table(React1000$Gender)

##
## Female Male
## 517 483

Use prop.test to conduct this test.

prop.test(c(71,53),c(483,517), alternative = "greater",

correct = FALSE)

##
## 2-sample test for equality of proportions without continuity correction
##
## data: c(71, 53) out of c(483, 517)
## X-squared = 4.5489, df = 1, p-value = 0.01647
## alternative hypothesis: greater
## 95 percent confidence interval:
## 0.01007626 1.00000000
## sample estimates:
## prop 1 prop 2
## 0.1469979 0.1025145

What is your decision regarding this test?

As the value of p is less than level of significance (0.02), we fail to reject H0.So there is sufficient evidence
to conclude that the proportion of boys who are LMH differs from the proportion of girls who are LMH
Use prop.test to create a confidence interval for the difference pM − pF .

prop.test(c(71,53),c(483,517), alternative = "two.sided",conf.level = 0.98,

correct = FALSE)

##
## 2-sample test for equality of proportions without continuity correction
##
## data: c(71, 53) out of c(483, 517)
## X-squared = 4.5489, df = 1, p-value = 0.03294

5
## alternative hypothesis: two.sided
## 98 percent confidence interval:
## -0.004179282 0.093146127
## sample estimates:
## prop 1 prop 2
## 0.1469979 0.1025145

Print out the confidence interval below.

confidence interval is [-0.0042, 0.0931]
Exercise: Using the Company500 dataset, use the table function to create a table comparing the
ages and employment status of the employees in this sample.

table(Company500$Age.Bracket,Company500$Status)

##
## Hourly Salaried
## Over 40 105 59
## Under 40 246 90

Exercise: Perform a test at the 1% level of significance to determine whether the proportion of
employees who are over 40 differs between the salaried and hourly workers. Mention whether
you reject or fail to reject H0 .

prop.test(c(105,59),c(351,149), alternative = "two.sided",conf.level = 0.99,

correct = FALSE)

##
## 2-sample test for equality of proportions without continuity correction
##
## data: c(105, 59) out of c(351, 149)
## X-squared = 4.4492, df = 1, p-value = 0.03492
## alternative hypothesis: two.sided
## 99 percent confidence interval:
## -0.21771465 0.02405894
## sample estimates:
## prop 1 prop 2
## 0.2991453 0.3959732

The p value is above 1% level of significance so we fail to reject H0.

Z - TEST and T Test
No ratings yet
Z - TEST and T Test
45 pages
ProbList5-24-Sln
No ratings yet
ProbList5-24-Sln
9 pages
One-Sample Test of Proportions: Z 1.733 One-Tailed Probability 0.042 Two-Tailed Probability 0.084
No ratings yet
One-Sample Test of Proportions: Z 1.733 One-Tailed Probability 0.042 Two-Tailed Probability 0.084
4 pages
Using R For Nonparametric Analysis
No ratings yet
Using R For Nonparametric Analysis
9 pages
Lab6_Hypothesis testing and confidence intervals in R
No ratings yet
Lab6_Hypothesis testing and confidence intervals in R
3 pages
Practical 8 PDF
No ratings yet
Practical 8 PDF
3 pages
lect_w7_f2023
No ratings yet
lect_w7_f2023
13 pages
R commands New 2
No ratings yet
R commands New 2
23 pages
Summary Review of Hypothesis
No ratings yet
Summary Review of Hypothesis
7 pages
Unit 2 Assignment SKELETON R spr18
No ratings yet
Unit 2 Assignment SKELETON R spr18
12 pages
Final Exam Practice Questions
No ratings yet
Final Exam Practice Questions
16 pages
BE186
No ratings yet
BE186
51 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
8 pages
Non Parametric Homework-9
No ratings yet
Non Parametric Homework-9
6 pages
WINSEM2015-16 CP1615 18-MAR-2016 RM01 Z-Test For Means and Proprtions
0% (2)
WINSEM2015-16 CP1615 18-MAR-2016 RM01 Z-Test For Means and Proprtions
7 pages
Stats 10 F21 Lab 5
No ratings yet
Stats 10 F21 Lab 5
6 pages
Discussion1 Solution
No ratings yet
Discussion1 Solution
5 pages
Statistics EXP-5
No ratings yet
Statistics EXP-5
10 pages
10-Sample Techniques - Two Sample
No ratings yet
10-Sample Techniques - Two Sample
7 pages
WINSEM2024-25_BMAT202L_TH_VL2024250501197_2025-03-03_Reference-Material-I
No ratings yet
WINSEM2024-25_BMAT202L_TH_VL2024250501197_2025-03-03_Reference-Material-I
65 pages
Modern Regression Homework 5-1
No ratings yet
Modern Regression Homework 5-1
8 pages
21bce0427 VL2022230503921 Ast04
No ratings yet
21bce0427 VL2022230503921 Ast04
13 pages
Unit 5.2 Testing Two Population Means
No ratings yet
Unit 5.2 Testing Two Population Means
24 pages
Student S T Statistic: Test For Equality of Two Means Test For Value of A Single Mean
No ratings yet
Student S T Statistic: Test For Equality of Two Means Test For Value of A Single Mean
35 pages
Z Test For Proportion
No ratings yet
Z Test For Proportion
29 pages
W3 - Testing Means - Choose Your Test
No ratings yet
W3 - Testing Means - Choose Your Test
7 pages
3 Proportion Test
No ratings yet
3 Proportion Test
15 pages
Physics ML
No ratings yet
Physics ML
10 pages
190 When Mu o Gives More Than Just The Mean
No ratings yet
190 When Mu o Gives More Than Just The Mean
12 pages
Chapter 10 Lecture-1
No ratings yet
Chapter 10 Lecture-1
13 pages
Lab 5 - Shell
No ratings yet
Lab 5 - Shell
7 pages
Introduction To Data Analysis Solutions
No ratings yet
Introduction To Data Analysis Solutions
5 pages
Test of Significance (Large Sample)
No ratings yet
Test of Significance (Large Sample)
21 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
PS 3 Bus 310 Resubmit
No ratings yet
PS 3 Bus 310 Resubmit
7 pages
IntroStat 227 240
No ratings yet
IntroStat 227 240
14 pages
Module 4
No ratings yet
Module 4
99 pages
HW12 Sol
No ratings yet
HW12 Sol
9 pages
Assignment7 Solutions
No ratings yet
Assignment7 Solutions
7 pages
05-Hypothesis Testing T-Test (1) - 54
No ratings yet
05-Hypothesis Testing T-Test (1) - 54
56 pages
Mod_7_Study_Guide
No ratings yet
Mod_7_Study_Guide
3 pages
Lect 1 18
No ratings yet
Lect 1 18
22 pages
Final 221220 Statmeth Solutions
No ratings yet
Final 221220 Statmeth Solutions
4 pages
Week 4 Module in Stat 4th Quarter
No ratings yet
Week 4 Module in Stat 4th Quarter
13 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
S Lab5 2507 W10
100% (1)
S Lab5 2507 W10
4 pages
Exp 7,8,9,10
No ratings yet
Exp 7,8,9,10
10 pages
MH3511 Data Analysis With Computer: Lab 5 (Solution) AY2019/20 Semester 2
No ratings yet
MH3511 Data Analysis With Computer: Lab 5 (Solution) AY2019/20 Semester 2
5 pages
2020 AP Statistics Exam: Formula Sheet
No ratings yet
2020 AP Statistics Exam: Formula Sheet
5 pages
Stat Unit 3 - T Test
No ratings yet
Stat Unit 3 - T Test
25 pages
Nu - Edu.kz Econometrics-I Assignment 7 Answer Key
No ratings yet
Nu - Edu.kz Econometrics-I Assignment 7 Answer Key
6 pages
Biometrics 2011 II 7
No ratings yet
Biometrics 2011 II 7
16 pages
Unit 5 Exam Review Answers
No ratings yet
Unit 5 Exam Review Answers
6 pages
UL3
No ratings yet
UL3
2 pages
Solutions Exercises Chapter 4 - Statistics For Engineers Exercise 1 A. 1. Model: The Starting Salaries (In K
No ratings yet
Solutions Exercises Chapter 4 - Statistics For Engineers Exercise 1 A. 1. Model: The Starting Salaries (In K
6 pages
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet
Unit 5
No ratings yet
Unit 5
185 pages
Unit 4 2000
No ratings yet
Unit 4 2000
50 pages
Unit 3 2000
No ratings yet
Unit 3 2000
111 pages
Unit 2 2000
No ratings yet
Unit 2 2000
99 pages
Lab 7 - Shell
No ratings yet
Lab 7 - Shell
6 pages
Lab 6 - Shell
No ratings yet
Lab 6 - Shell
7 pages
Social Networking Pros and Cons
No ratings yet
Social Networking Pros and Cons
12 pages
Robust Detection of Multiple Outliers in A Multivariate Data Set
No ratings yet
Robust Detection of Multiple Outliers in A Multivariate Data Set
30 pages
Business Statistics SIM Semester 1 2019: Welcome - Lecture 1: Ms. Kathryn Bendell Email
No ratings yet
Business Statistics SIM Semester 1 2019: Welcome - Lecture 1: Ms. Kathryn Bendell Email
38 pages
Hypothesis Testing 1
No ratings yet
Hypothesis Testing 1
61 pages
What Methods Are Most Frequently Used in Research in Criminology and Criminal Justice?
No ratings yet
What Methods Are Most Frequently Used in Research in Criminology and Criminal Justice?
6 pages
Assignment 4
No ratings yet
Assignment 4
14 pages
UDAU M6 Correlation & Regression
No ratings yet
UDAU M6 Correlation & Regression
26 pages
Exercise Sheet Regression
No ratings yet
Exercise Sheet Regression
2 pages
Statap Practicetest 27
No ratings yet
Statap Practicetest 27
7 pages
CU-2022_B.Sc._(General)_Statistics_Semester-1_Paper-GE-1.1_Chg_QP
No ratings yet
CU-2022_B.Sc._(General)_Statistics_Semester-1_Paper-GE-1.1_Chg_QP
6 pages
Intermediate Regression With Statsmodels in Python
No ratings yet
Intermediate Regression With Statsmodels in Python
129 pages
Non-Parametric Tests
100% (1)
Non-Parametric Tests
55 pages
Demand Forecasting - Lecture Notes
100% (1)
Demand Forecasting - Lecture Notes
30 pages
R Project
No ratings yet
R Project
14 pages
18 Control Charts
100% (6)
18 Control Charts
2 pages
Outlier Sample, Tasneem Ahmad
No ratings yet
Outlier Sample, Tasneem Ahmad
9 pages
Statistical Tables and Formulae PDF
No ratings yet
Statistical Tables and Formulae PDF
93 pages
Section32 Measures of Central Tendency and Dispersion
No ratings yet
Section32 Measures of Central Tendency and Dispersion
19 pages
Section 03.4 Shared Lab
No ratings yet
Section 03.4 Shared Lab
5 pages
Unit-5 Anova
No ratings yet
Unit-5 Anova
12 pages
Pre Test
No ratings yet
Pre Test
9 pages
Ist 407 Presentation
No ratings yet
Ist 407 Presentation
12 pages
Sample in Variance and Standard Deviation
100% (1)
Sample in Variance and Standard Deviation
3 pages
Regression Logistic 4
No ratings yet
Regression Logistic 4
51 pages
Assignment 3
100% (1)
Assignment 3
5 pages
FINAL EXAM - Attempt Review PDF
No ratings yet
FINAL EXAM - Attempt Review PDF
9 pages
Index Crime and Non Index Crime
No ratings yet
Index Crime and Non Index Crime
18 pages
Pengaruh Kompensasi Terhadap Kinerja Karyawan: Abstract
No ratings yet
Pengaruh Kompensasi Terhadap Kinerja Karyawan: Abstract
9 pages
Histogram: Product Weight For A Sample of 40. Target Weight 50.0 Grams
No ratings yet
Histogram: Product Weight For A Sample of 40. Target Weight 50.0 Grams
2 pages
Cost
No ratings yet
Cost
6 pages