Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
0% found this document useful (0 votes)
11 views

Module 4

big data analytics
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views

Module 4

big data analytics
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Module 4

1. Consider a table shows the advertisement made by the company in the last 5 years and the
corresponding sales:
Advertisement Sales
$90 $1000
$120 $1300
$150 $1800
$100 $1200
$130 $1389
$200 $2318

Apply linear regression technique to help the company predict the sales for $200
advertisement in the year 2019.
2. Experiment Z-Score and InterQuartileRange methods to determine outliers in a dataset
with example programs.
3.If probability density function is given as
X 1 2 3 4 5 6 7
P(X=x) K 3K 5K 7K 9K 11K 13K

Then find
i) K ii)Probability distribution iii)Mean iv)Variance v)STD vi)P(x>5) vii)P(X<=5)
Sum of probability=1
K+3K+5K+7K+9K+11K+13K=1
K=1/49
ii)P(1)=1/49 P(2)=3/49 P(3)=5k=5/49
iii)Mean=1/49+2*3/49+3*5/49+4*7/49+5*9/49+6*11/49+7*13/49
iv)Variance====sum of x^2*P(x)-mean^2
STD=sqrt(variance)
P(x>5)=P(x=6)+P(7)
P(X<=5)=P(5)+P(4)+P(3)+P(2)+P(1)

4. For the following example discuss the different steps of forming association rules using
Apriori algorithm. Assume Support = 33% & Confidence = 50%
Sl. No Transaction List
1 Milk Egg Bread Butter
2 Milk Butter Egg Ketchup
3 Bread Butter Ketchup
4 Milk Bread Butter
5 Bread Butter Cookies
6 Milk Bread Butter Cookies
7 Milk Cookies
8 Milk Bread Butter
9 Bread Butter Egg Cookies
10 Milk Butter Bread
11 Milk Bread Butter
12 Milk Bread Cookies Ketchup

5. Determine how does Apriori algorithm Work? Apply same for the following example.
Assume the S=2 and Confidence = 50%
Tid List of Item ID’s
T100 I1 I2 I5
T200 I2 I4
T300 I2 I3
T400 I1 I2 I4
T500 I1 I3
T600 I2 I3
T700 I1 I3
T800 I1 I2 I3 I5
T900 I1 I2 I3
6. Describe the regression analysis predict the value of the dependent variable in case of
linear regression.
7. Implement the SVM classifier which classifies the input dataset on the basis of
transparency of the colors.
8. Describe different similarity measurement methods.
Linear Regression-----
Y=mx+b
Y---dependent m x n b
X is independent variable
m----slope ie changes in y value as unit change of x value
b----value of Y when x=0
Predict pizza price by using its size
X=Pi Y=Pi Me Me Devia Devia Product of Sum Square of sum
zza zza an an tion tion deviation= of X of
Size price of of of of DX*DY prod deviation= squar
in in X Y X=xi- Y=yi- uct of DX^2 e of X
Diam RS mean mean devia devia
eter tion tion
8 100 11 126. -3 -26.66 79.98 169.9 9 26
10 130 66 -1 3.33 -3.33 7 1
15 150 4 23.33 93.32 16

20 185.4
3

m=sum of product of deviation/sum of square of X deviation=169.9/26=6.53


b=Y Mean-(m*X mean)=126.66-(6.53*11)=54.83
Y=mx+b=6.53*20+54.83==185.43

You might also like