BDA Module 3
BDA Module 3
BDA Module 3
2 2
x
2
ty t y
For
X=(y23-1)
y=(2
Y.y CIx2)(2xD +Ex)
22-)
lx(=t4t
tyt=
PAGE N
DATE
Eclidian discence
=
d(xy=0
A Sqtiihy Symmetic poperty beca e
CX-y)2 =(yi-x)2
For exU ( 2 ) and (64)
(2-6 (47-4) 2
4)2t(3)2
JlG+ 9
J25
5
Jaccar Disane
y=i 5,8)7 lo
=4
2
3
The distan Ce becomesense when po) ntS ee
sting the distanle oet 2 stngs
x abc de
y=actdegds
2 De ete b
I n s e i t tsalte
Edit distance =3
Hamming d:stance
2
=3 CoS
3
JG
J4tL+
y=)+()w for
+L JIt4 J
+(2)+EY? forx L2nom
)+(2X1+E)x]
ofbothecors
m NoL2
fvectors poduct Dot Cos=
m ndatlon Recornme tin PliCa LL2- Y=
uers. ice andsimilaity
te is mof anglR the LesS Distan
e aCosine
eeto bet) angte iS Tt
Sums:. BDA
DATE
PAGE
NG
PAGE No.
DATE
A={L2,3
6={i24,5
AUB
2
5
5
. only
o Contain cohich is
os boolean
ue for wsed DstRnce
- Hamming 4
OATE
2
N(a-2)+c4-72
2.
4+2)
Nl6+9
J25
5
Alsownte exane br
or all o boul re
all gb
Teacher's Signature:...
(PAGE No.
DATE
N s 20
m ouceets:.
2
4 4
2 2
OHb Sco e
O Auehority Scoe
fage Ronk
is the functien that assigrsa ealnumberto each
page ín t e e
Teacher's Signaturei..
DATE
hsert
lter e
-. Cdit ditance 3
Teacher's Signature:.
Map Tasks
Achunk is a collection of elements and no element is stored across twochunks.
Technically, all inputs to Map tasks and outputs from Reduce tasks are of the key-valuc-pair
form.
The Map function takes an input clement as its argument and produces zero or morckey-valuc
pairs.
Grouping by Key
As soon as the Map tasks have all completed successfully,the key-value pairs are grouped by
key and the values associated with each key are formed intoa list of values.
2| Page
ENG 15 AM
O Type here to search BSE smlcap -5.27% ^G 4)
G O File D:/BDA%20Practs/BDA620Notes620 By%20Santosh620 Tamboli%20Sir.pdf
3 of 33
The user typically tells the MapReduce system what r should be. Then the master controller
picksa hash function that applies to keys and produces a bucket number from 0 to r- 1.
M
Reduce Tasks
The Reduce function's argument is a pair consistingof a key and list of associated values.
The output of the Reduce function is a sequence of zero or more key-value pairs.
These key-value pairs can be of a type different from those sent from Map tasks to Reduce tasks,
but often they are the same type.
A Reduce task receives one or more keys and their associated value lists.
Reduce task executes one or more reducers. The outputs from all the Reduce tasks are merged
into a single file.
Combiners
These key-value pairs would thus be replaced by one pair with key w and value ccgal to the sum
of all the l's in those pairs.
That is, the pairs with key w generated by a single Map task
would be replaced by a pair (w,m), where m is the number of times thatw appears.
Type here tosearch ENG 923 AM
29C Smoke G 40
Q. What is PageRank?
PageRank is a function that assigns, areal number to each page in the Web. M
The intent is that the higher the PageRank of a page, the more
"important" it is.
9|Page
There is not one fixed algorithm for assignment of PageRank and variations on the basic idea can
alter the relative PageRank of any two pages.
Web can be represented as a directed graph, where pages are the nodes, and there is an arc from
ENG A1 AM
O Type here to search 29°C Smoke ^G ) IN
10 of 33
C.g.
N= 244winda Size)
(4). (2)
TosestmpsR7 92 95
92 9
92
42 8S 98 lo2
|o|011ooooloL|oo1o|
8592. 98
o101oool ot u|o|oooi|
To oolol|ooo1o1l| oo
8592 98 lo2