Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

IPL PA-nik

Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 6

Q-1 Based on the data set, prepare a report using appropriate exploratory data analysis tools

and techniques. The report must include all necessary charts and tables for both univariate and
multivariate analysis with clear interpretation in each case.
Univariate Analysis (IPL_BATS)
summary()
boxplot(e$Runs_Scored) boxplot(e$Strike_Rate)

Range for Strike rate is less with 04 outliers on lower side


Range for Runs scored is skewed on lower side with 01
and two on upper side. Median strike rate is nearly 140.
outliers on upper side. Median run scored is 2000.
boxplot(e$Bat_Average) boxplot(e$Salary)

Range for Runs scored is skewed on lower side with 01 Range for salary is more and skewed on lower side with 02 outliers
outliers on upper side. Median of average score is 25. on upper side. Median salary is nearly 5 Lakhs.
Correlation between Salary and Hundred Correlation between Salary and Bat Average

Significant as
P<0.05
Significant as
P<0.05

Correlation between Salary and Strike Rate Correlation between Salary and Runs scored

Significant as
P<0.05
Significant as
P<0.05
Q-2 Based on the predictive model for Salary, Interpret the model output.
relations<-
lm(Salary~Strike_Rate+Runs_Scored+Fifties+Sixes+Fours+Bat
_Average,data=e)
relation<-lm(Salary~Strike_Rate,data=e) summary(relations)
summary(relation)

Strike rate P value <0.05 and is significant variable Strike rate, runs scored , sixes and fours are having
for prediction of salary. p value <0.05 and are significant.
Q-3 Is the model performance satisfactory? Will it be possible to use such a model in deciding the
salary of IPL players? Discuss in detail.

Q-4 How will you improve the performance of such a model? Discuss in detail.

You might also like