8 - Knowledge in Learning
8 - Knowledge in Learning
Week 8
Knowledge in Learning
LEARNING OUTCOMES
But
1. Checking all previous instances over again is expensive.
2. Difficult to find good heuristics, and backtracking is slow in the
hypothesis space (which is doubly exponential)
A LOGICAL FORMULATION OF LEARNING
Current-best-hypothesis search
Least commitment:
Instead of keeping around one hypothesis and using backtracking, keep
all consistent hypotheses (and only those).
Decision-Tree-Learning
o Grandparent (⟨Mum , Charles ⟩) . . .
o FirstElementIsMotherOfElizabeth(⟨Mum,Charles⟩) .
The reader will certainly have noticed that a little bit of background
knowledge would help in the representation of the Grandparent
definition. For example, if Background included the sentence
Parent(x,y) ⇔ [Mother(x,y)∨Father(x,y)],
then the definition of Grandparent would be reduced to
Grandparent(x,y) ⇔ [∃z Parent(x,z)∧Parent(z,y)]
INDUCTIVE LOGIC PROGRAMMING
INDUCTIVE LOGIC PROGRAMMING
The agent’s policy π is fixed: in state s, it always executes the action π(s).
Its goal is simply to learn how good the policy is—to learn the utility
function Uπ(s).
PASSIVE REINFORCEMENT LEARNING