Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
46 views

Assignment 2

This document provides the details of an assignment for a regression analysis course. Students are asked to perform several regression analyses using an IPL player dataset to explore relationships between player attributes like batting average and sold price. They must submit their analysis and R code in Google Classroom. The assignment includes 5 questions involving simple and multiple linear regression models to examine how factors like age, batting average, and sixes influence a player's sold price.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views

Assignment 2

This document provides the details of an assignment for a regression analysis course. Students are asked to perform several regression analyses using an IPL player dataset to explore relationships between player attributes like batting average and sold price. They must submit their analysis and R code in Google Classroom. The assignment includes 5 questions involving simple and multiple linear regression models to examine how factors like age, batting average, and sixes influence a player's sold price.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Indian Institute of Management

Tiruchirappalli ​PGP Term IV BSF (2020) -


Assignment 2 (Regression)

Submission Date: Google

Classroom ​Maximum Marks: 10

This is ​a ​group a
​ ​ssignment. You need to use R progr​a​mming for a
​ ​ll da
​ ​t​a

cle​a​ning, ​ma
​ ​nipula
​ ​tion, ​a​nd ​a​na
​ ​lysis. Submit the soft copy of your ​a​ssignment, R

codes, ​a​nd other supporting documents in the Google Cl​a​ssroom (No h​a​rd

copy/em​a​il). Show ​a​dequ​a​te re​a​soning, including ca


​ ​lcula
​ ​tions, if a
​ ​ny, in support

of your ​a​nswers. St​a​te ​a​ny ​a​ssumption you m​a​ke (your ​a​ssumptions need to be

​ ​ble). Mention
re​a​sona ​ your Group No (1 or 2) and team No (1 to 10) in your

submission file-name ​(​Assignment1_GR1_TEAM5.R​). Also, mention your team

details in one of your uploading files. The


​ following questions are based on

IMB381IPL2013.xlsx​. Use ​readxl ​r-package to load ​the xlsx file (or you can

convert the xlsx to csv). You can find the details about the data set from the

textbook ("Business Analytics: The Science of Data - Driven Decision Making")

Page number 314-320.

1. Prepare a summary or visualization to answer the following question. "Cricket

in the ​T20 format is considered a young man’s sport, is there evidence that the

player’s price is influenced by age?". Comment (answer could be subjective). ​[1


Mark]
2. Develop a simple linear regression model using sold price as the response

variable and ​batting average as the explanatory variable. Comment on the

model, is there any evidence to suggest that the batting average has a

statistically significant relationship with sold price. ​[1 Mark]


3. Develop a multiple linear regression model between sold price and the

batting average ​and number of sixers. Compare this model with the model in

question 2. What conclusions can you reach based on both these models. ​[2

Marks]
4. Give the output of the regression model which you think the best. ​Brief ​the

steps you ​have followed. What is the Mallows’s Cp of the model? Identify the

player who is highly overpaid and the player who is highly underpaid. ​[3 Marks]
5. Comment on the following
questions

(a) Is the data sufficient to develop the regression model to explain the

variation in the ​price of IPL players. (b)


​ What other variables (which is not in
the data set)could improve the model? (To ​get the full marks your answer

should be practical and least expensive to get the extra information) (c)

Which of the following you think the best dependent variable in terms of

satisfying ​the regression assumptions: 1) Sold Price 2) Log(Sold Price) 3)

Sold Price - Base Price 4) log(Sold Price/Base Price). Why?


[3 Marks]

You might also like