0% found this document useful (0 votes)

28 views

Lab 6 - Shell

This document describes a lab on performing ANOVA in R. The learning objectives are to learn how to perform ANOVA in R using both step-by-step methods and functions, and to perform investigations of the ANOVA model assumptions. The document contains exercises using a dataset of video game reviews to determine if different platforms have different average review scores, and using the iris dataset to determine if species have different average sepal lengths. The results of these analyses support rejecting the null hypotheses and concluding that platforms and species differ in their average scores/lengths.

Uploaded by

Mansi

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views

Lab 6 - Shell

Uploaded by

Mansi

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Lab 6 - ANOVA 1

Mansi Kumari (7908159)

2023-03-03

Learning Objectives

By the end of this lab, you should have a grasp on the following concepts:

• How to perform ANOVA in R, both step-by-step and with an easy R function.

• How to perform a simple investigation of the model assumptions.

Instructions

To complete this worksheet, add code as needed into the R code chunks given below. Do not delete the
question text. All text should be in complete English sentences. Be sure to change the author of this file to
reflect your name and student number.
To properly see the questions, knit this .Rmd file to .pdf and view the output. You will have a link in your
email that takes you to the Crowdmark submission page. Once you have completed the worksheet, knit it
to .pdf and upload your output to Crowdmark.

1
Exercises
Import the Games200 dataset. This dataset contains a random sample of 200 games released in 2019, along
with the metascore (average critic review), the userscore (average user review), and platform of release.

Games200 <- read.csv("~/Downloads/Games200.csv")

Our goal is to determine whether each video game platform receives the same metascore on average, or not,
based on this sample.
Make a boxplot comparing the metascores for each platform.

boxplot(Metascore ~ Platform, data = Games200)

90
80
Metascore

70
60
50

PC PlayStation 4 Switch Xbox One

Platform

Use aggregate to calculate the mean of each group

aggregate(Metascore ~ Platform, data = Games200,FUN = mean)

## Platform Metascore
## 1 PC 74.63462
## 2 PlayStation 4 71.48889
## 3 Switch 72.24675
## 4 Xbox One 78.11538

Use aggregate to determine the sample size of each group.

2
aggregate(Metascore ~ Platform, data = Games200,FUN = length)

## Platform Metascore
## 1 PC 52
## 2 PlayStation 4 45
## 3 Switch 77
## 4 Xbox One 26

Calculate the overall mean.

mean(Games200$Metascore)

## [1] 73.46

Calculate the SSG by hand, using your earlier calculations.

my.SSG<-52(74.63-73.46)ˆ2 + 45(71.48-73.46)ˆ2 + 77(72.25-73.46)ˆ2 + 26(78.12-73.46)ˆ2

my.SSG

## [1] 924.9421

Calculate the MSG by hand, using your earlier calculations.

my.MSG <- my.SSG/(4 - 1)

my.MSG

## [1] 308.314

Use the aggregate function with var to find the sample variances, and then from there find the SSE.

aggregate(Metascore ~ Platform, FUN = var, data = Games200)

## Platform Metascore
## 1 PC 58.78544
## 2 PlayStation 4 68.84646
## 3 Switch 57.42515
## 4 Xbox One 43.06615

my.SSE <- 5158.79 + 4468.85 + 7657.43 + 2543.07

my.SSE

## [1] 11469.12

Calculate the MSE by hand, using your earlier calculations.

my.MSE <- my.SSE/(200 - 4)

my.MSE

## [1] 58.51592

Calculate the F test statistic, using your earlier calculations.

3
my.F <- my.MSG/my.MSE
my.F

## [1] 5.268892

Use pf to find the P-value for this test.

1 - pf(my.F, df1 = 3, df2 = 196)

## [1] 0.001622573

What is your conclusion?

The p-value is 0.00162.We can conclude that we would reject our null hypothesis at 5% level of significance.We
have sufficient evidence to conclude that not all platforms have the same mean.
Repeat the earlier test, using the aov function.

my.aov <- aov(Metascore ~ Platform, data = Games200)

Use the summary function to print out the ANOVA results.

summary(my.aov)

## Df Sum Sq Mean Sq F value Pr(>F)

## Platform 3 923 307.80 5.261 0.00164 **
## Residuals 196 11468 58.51
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

Create a histogram of the residuals of the ANOVA model

hist(my.aov$residuals)

4
Histogram of my.aov$residuals
50
40
Frequency

30
20
10
0

−20 −10 0 10 20

my.aov$residuals

What does this tell you about your Normality assumption?

Use the aggregate function with sd to find the standard deviations of each group.

aggregate(Metascore ~ Platform, FUN = sd, data = Games200)

## Platform Metascore
## 1 PC 7.667167
## 2 PlayStation 4 8.297377
## 3 Switch 7.577939
## 4 Xbox One 6.562481

What does this tell you about your equal-variances assumption?

Next we will do ANOVA on the iris dataset. Use the data function to load in this dataset.

data(iris)

5
This dataset contains the petal and sepal lengths and widths (in cm) for a sample of 150 iris flowers. They
are divided by their species: iris setosa, iris virginica, and iris versicolor.
We will do an analysis to determine if their sepal widths differ significantly, on average.
Exercise: Write the hypotheses for this test in TeX

H0 : µSetosa = µV irginica = µV ersicolor vs Ha : Not all means are equal

Exercise: Use the aov function to conduct a hypothesis test at the 5% level of significance to
determine whether the mean sepal lengths are equal for all species.

my_aov <-aov(Sepal.Length~Species,data = iris)

summary(my_aov)

## Df Sum Sq Mean Sq F value Pr(>F)

## Species 2 63.21 31.606 119.3 <2e-16 ***
## Residuals 147 38.96 0.265
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’ 1

Exercise: Give a fully-worded conclusion to this test.

As our p-value is below 5% because we conducted this test at 5% level of significance which means we reject
our null hypothesis and there is sufficient evidence at 5 % level of significance to conclude that the mean
sepal lengths is not equal for all species.
Exercise: Check whether the ANOVA model assumptions appear to be accurate.

hist(my_aov$residuals)

6
Histogram of my_aov$residuals
60
50
40
Frequency

30
20
10
0

−2.0 −1.5 −1.0 −0.5 0.0 0.5 1.0 1.5

my_aov$residuals

aggregate(Sepal.Length~Species,data = iris,FUN = sd)

## Species Sepal.Length
## 1 setosa 0.3524897
## 2 versicolor 0.5161711
## 3 virginica 0.6358796

The residuals appear to have an approximately normal shape, and also that none of the standard deviations
are twice the size of the other ,so that the conditions of the test appear to be satisfied .

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
ANOVA and Chi Square
No ratings yet
ANOVA and Chi Square
67 pages
Sample Concrete Mix Design
100% (1)
Sample Concrete Mix Design
3 pages
Lab 7 - Shell
No ratings yet
Lab 7 - Shell
6 pages
ANOVA Matlab Instructions PDF
No ratings yet
ANOVA Matlab Instructions PDF
6 pages
ANOVA Matlab Instructions
No ratings yet
ANOVA Matlab Instructions
6 pages
Chapter 14 - Analysis of Variance (ANOVA) : TI-83/84 Procedure
No ratings yet
Chapter 14 - Analysis of Variance (ANOVA) : TI-83/84 Procedure
6 pages
R Console
No ratings yet
R Console
6 pages
Principles of The T-Test and ANOVA
No ratings yet
Principles of The T-Test and ANOVA
64 pages
Statistics Module 7
No ratings yet
Statistics Module 7
13 pages
Tu
No ratings yet
Tu
15 pages
ANOVA in R
No ratings yet
ANOVA in R
7 pages
Case Study - Pontius Data: at - at May Not Be Good Enough
No ratings yet
Case Study - Pontius Data: at - at May Not Be Good Enough
9 pages
R Code For Linear Regression Analysis 1 Way ANOVA
No ratings yet
R Code For Linear Regression Analysis 1 Way ANOVA
8 pages
Basic Descriptive Statistics Using R
No ratings yet
Basic Descriptive Statistics Using R
4 pages
QSCI 381 Lecture 8
No ratings yet
QSCI 381 Lecture 8
35 pages
04 BasicAnalyses
No ratings yet
04 BasicAnalyses
44 pages
BES - R Lab 4
No ratings yet
BES - R Lab 4
6 pages
Unit 4 L3 Analysis of Empirical Data
No ratings yet
Unit 4 L3 Analysis of Empirical Data
29 pages
R 2nd IA
No ratings yet
R 2nd IA
7 pages
Anova: Module 3 - Advanced Statistics
No ratings yet
Anova: Module 3 - Advanced Statistics
17 pages
Ds Practical
No ratings yet
Ds Practical
25 pages
Analysis of Variance: One-Way ANOVA Post-Hoc Tests Two-Way ANOVA
No ratings yet
Analysis of Variance: One-Way ANOVA Post-Hoc Tests Two-Way ANOVA
12 pages
Basic Concepts of One Way Analysis of Variance (ANOVA)
No ratings yet
Basic Concepts of One Way Analysis of Variance (ANOVA)
38 pages
Basic Concepts of One Way Analysis of Variance (ANOVA)
No ratings yet
Basic Concepts of One Way Analysis of Variance (ANOVA)
38 pages
Understanding ANOVA Output
No ratings yet
Understanding ANOVA Output
3 pages
Exercise 8 Micro 110 2023
No ratings yet
Exercise 8 Micro 110 2023
3 pages
Basic Concepts of One Way Analysis of Variance (ANOVA)
No ratings yet
Basic Concepts of One Way Analysis of Variance (ANOVA)
30 pages
Assignment - Exercise 6.1 .Anova
No ratings yet
Assignment - Exercise 6.1 .Anova
13 pages
Unit 5 - STUDENTS - ANOVA
No ratings yet
Unit 5 - STUDENTS - ANOVA
32 pages
Example ANOVA
50% (2)
Example ANOVA
3 pages
Minitab ANOVA
No ratings yet
Minitab ANOVA
3 pages
AD3411 - 6 To11
No ratings yet
AD3411 - 6 To11
15 pages
Final Data Lab
No ratings yet
Final Data Lab
21 pages
Six Sigma - Live Lecture 14
No ratings yet
Six Sigma - Live Lecture 14
66 pages
H H: Not H: Data ("Iris") View (Iris) Head (Iris)
No ratings yet
H H: Not H: Data ("Iris") View (Iris) Head (Iris)
9 pages
Analysis of Variance
No ratings yet
Analysis of Variance
8 pages
Data Anlalysis
No ratings yet
Data Anlalysis
6 pages
Aggregation Indices in R
No ratings yet
Aggregation Indices in R
12 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
Chapter 11 - ANOVA 5
No ratings yet
Chapter 11 - ANOVA 5
36 pages
Anova
No ratings yet
Anova
51 pages
2 Sample T-Test (Unequal Sample Sizes and Unequal Variances)
No ratings yet
2 Sample T-Test (Unequal Sample Sizes and Unequal Variances)
6 pages
Anova
No ratings yet
Anova
34 pages
Statistics Toolbox 7: Perform Statistical Analysis, Modeling, and Algorithm Development
No ratings yet
Statistics Toolbox 7: Perform Statistical Analysis, Modeling, and Algorithm Development
6 pages
Analysis of Variance (Anova)
No ratings yet
Analysis of Variance (Anova)
17 pages
ANOVA
No ratings yet
ANOVA
39 pages
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
No ratings yet
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
30 pages
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
No ratings yet
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
30 pages
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
No ratings yet
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
30 pages
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
No ratings yet
Anova: Analysis of Variation: Math 243 Lecture R. Pruim
30 pages
TC2-Lab Manual
No ratings yet
TC2-Lab Manual
35 pages
ADSEXP_1
No ratings yet
ADSEXP_1
6 pages
Introduction To Matlab Lecture Advanced Data Analysis Jan2012
No ratings yet
Introduction To Matlab Lecture Advanced Data Analysis Jan2012
50 pages
100 Anova
No ratings yet
100 Anova
4 pages
FIT3152 Data Analytics. Tutorial 01: Introduction To R. Review of Basic Statistics
No ratings yet
FIT3152 Data Analytics. Tutorial 01: Introduction To R. Review of Basic Statistics
4 pages
STA1007S Lab 10: Confidence Intervals: October 2020
No ratings yet
STA1007S Lab 10: Confidence Intervals: October 2020
5 pages
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Apache Cassandra Developer Associate - Exam Practice Tests
From Everand
Apache Cassandra Developer Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Excel Simulations
From Everand
Excel Simulations
Gerard M. Verschuuren
3.5/5 (2)
Unit 5
No ratings yet
Unit 5
185 pages
Unit 3 2000
No ratings yet
Unit 3 2000
111 pages
Unit 4 2000
No ratings yet
Unit 4 2000
50 pages
Unit 2 2000
No ratings yet
Unit 2 2000
99 pages
Lab 8 - Shell
No ratings yet
Lab 8 - Shell
6 pages
Lab 5 - Shell
No ratings yet
Lab 5 - Shell
7 pages
Social Networking Pros and Cons
No ratings yet
Social Networking Pros and Cons
12 pages
Method Acting Reconsidered Theory Practice Future 1st Ed 978 0 312 22309 0978 1 349 62271 9
No ratings yet
Method Acting Reconsidered Theory Practice Future 1st Ed 978 0 312 22309 0978 1 349 62271 9
302 pages
Role of FICCI in Indian Ex
No ratings yet
Role of FICCI in Indian Ex
50 pages
The Rotordynamics Analysis of The Washing Machine Shaft Supported by Passive Magnetic
No ratings yet
The Rotordynamics Analysis of The Washing Machine Shaft Supported by Passive Magnetic
22 pages
Matter & Its Various States: of Solids
No ratings yet
Matter & Its Various States: of Solids
37 pages
LABS43 SBC Animal Feed
No ratings yet
LABS43 SBC Animal Feed
1 page
Unit 3: Capacity Requirement Planning (CRP)
No ratings yet
Unit 3: Capacity Requirement Planning (CRP)
25 pages
Fbs Week 5 Grade 7 8 Leap
No ratings yet
Fbs Week 5 Grade 7 8 Leap
4 pages
Factors Affecting Centralisation and Decentralisation: Presented By:-Himanshu Sharma
No ratings yet
Factors Affecting Centralisation and Decentralisation: Presented By:-Himanshu Sharma
12 pages
Action Research LAC
100% (1)
Action Research LAC
40 pages
182SWD-21
No ratings yet
182SWD-21
6 pages
LVGL
No ratings yet
LVGL
488 pages
Bee Lok
No ratings yet
Bee Lok
1 page
Rubric For Mathematical Presentations
No ratings yet
Rubric For Mathematical Presentations
1 page
Group-6 a338 Accresm
No ratings yet
Group-6 a338 Accresm
9 pages
Mathematics: Quarter 4
No ratings yet
Mathematics: Quarter 4
13 pages
Activity 1: K-W-L Chart What I Know What I Want To Know What I Learned
No ratings yet
Activity 1: K-W-L Chart What I Know What I Want To Know What I Learned
5 pages
Polynomials
No ratings yet
Polynomials
3 pages
Output Log
No ratings yet
Output Log
390 pages
Utilisation of Research Findings
100% (1)
Utilisation of Research Findings
13 pages
Gradient-Based Feature Extraction From Raw Bayer Pattern Images
No ratings yet
Gradient-Based Feature Extraction From Raw Bayer Pattern Images
12 pages
Cover page
No ratings yet
Cover page
4 pages
Essay
No ratings yet
Essay
1 page
Lecture 3 - Pressure of Concrete On Formwork
No ratings yet
Lecture 3 - Pressure of Concrete On Formwork
49 pages
Oratorical Piece For Teachers' Day
100% (7)
Oratorical Piece For Teachers' Day
2 pages
MT CE Inerting in The Chemical Industry UK A5 Fin tcm17-630096
No ratings yet
MT CE Inerting in The Chemical Industry UK A5 Fin tcm17-630096
29 pages
ME3493 MANUFACTURING TECHNOLOGY syllabus
No ratings yet
ME3493 MANUFACTURING TECHNOLOGY syllabus
2 pages
Fluency Speaking Activities
No ratings yet
Fluency Speaking Activities
9 pages
Answer Key
No ratings yet
Answer Key
5 pages
Zekarias Mekonnen
No ratings yet
Zekarias Mekonnen
68 pages