CSD311: Artificial Intelligence
CSD311: Artificial Intelligence
CSD311: Artificial Intelligence
Neural networks I
xd 1(bias)
wdj
w0j
xi wij Pd
j fj ( i=1 wij xi + w0j )
w1j Pd
x1 i=1 wij xi + w0j
I The output from a unit j is a non-linear function fj (called the
activation function) of the input. That is output of unit j, yj
is:
Xd
yj = fj (netj ) = fj ( wij xi + w0j )
i=1
xd−1 → · · K → zK
wjK
xi → i j k → zk
w1K
x2 → · · 1 → z1
x1 → 1 w11
1
K
1X
E(w) = (tk − zk )2
2
k=0
∂E ∂E ∂yj ∂netj
=
∂wij ∂yj ∂netj ∂wij
∂netj ∂yj
I ∂wij = xi and ∂netj = fj0 (netj )
PK
∂E ∂[ 1 k=0 (tk − zk )2 ]
= 2
∂yj ∂yj
K
X ∂zk
=− (tk − zk )
∂yj
k=0
K
X ∂zk ∂netk
=− (tk − zk )
∂netk ∂yj
k=0
XK
=− (tk − zk )fk0 (netk )wjk
k=0
So,
K
X
∆wij = ηfj0 (netj )xi (tk − zk )fk0 (netk )wjk
k=0
Training algorithm for 3NN