0% found this document useful (0 votes)

361 views

8 - Knowledge in Learning

The document discusses various techniques for artificial intelligence, including: 1. Explanation-based learning which constructs general rules from individual examples by creating proofs. 2. Inductive logic programming which induces general first-order theories from examples by representing hypotheses as logic programs. 3. Reinforcement learning where an agent learns optimal actions through trial-and-error interactions in an environment to achieve its goals, either passively by observing or actively by acting.

Uploaded by

Elsa Mutiara

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

361 views

8 - Knowledge in Learning

Uploaded by

Elsa Mutiara

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 35

Artificial Intelligence

Week 8
Knowledge in Learning
LEARNING OUTCOMES

At the end of this session, students will be able to:

LO2 Explain how to use knowledge representation in reasoning purpose
LO3 Apply various techniques to an agent when acting under certainty
OUTLINE

1. A Logical Formulation of Learning

2. Knowledge in Learning
3. Explanation Based Learning
4. Inductive Logic Programming
5. Passive and Active Reinforcement Learning
6. Generalization in Reinforcement Learning
7. Application of Reinforcement Learning
8. Summary
A LOGICAL FORMULATION OF LEARNING
o Study learning methods that can take advantage of prior knowledge
about the world. In most cases, the prior knowledge is represented
as general first-order logical theories; thus, for the first time we
bring together the work on knowledge representation and learning.
o The logical formulation of learning may seem like a lot of extra work
at first, but it turns out to clarify many of the issues in learning.
A LOGICAL FORMULATION OF LEARNING
• The hypothesis is represented by a set of logical sentences
• Example descriptions and classifications will also be logical
sentences.
• A new example can be classified by inferring a classification sentence
from the hypothesis and the example description
A LOGICAL FORMULATION OF LEARNING
o Goal and Hypotheses:
Goal predicate Q: WillWait
o Learning: to find an equivalent logical expression we can classify
examples
o Each hypotheses proposes such an expression
a candidate definition of Q:
A LOGICAL FORMULATION OF LEARNING
 An example: an object of some logical description to which the goal
concept may or may not apply

 The classification of the examples

 Each hypothesis hj have the form

where Cj (x) is a candidate definition

A LOGICAL FORMULATION OF LEARNING
 The relation between f and h are: ++, --, +- (false negative), -+(false
positive)
 An example can be a false negative for the hypothesis, if the hypothesis
says it should be negative but in fact it is positive.

would be a false negative for the hypothesis hr

A LOGICAL FORMULATION OF LEARNING
Current-best-hypothesis search
(extensions of predictor Hr)

Initial False False

hypothesis negative a generalization positive a specialization

Generalization e.g. via dropping conditions

Alternate(x)Patrons(x, Some)  Patrons(x, Some)
Specialization e.g. via adding conditions or via removing disjuncts
Alternate(x)Patrons(x, Some)  Patrons(x, Some)
A LOGICAL FORMULATION OF LEARNING
Current-best-hypothesis search

But
1. Checking all previous instances over again is expensive.
2. Difficult to find good heuristics, and backtracking is slow in the
hypothesis space (which is doubly exponential)
A LOGICAL FORMULATION OF LEARNING
Current-best-hypothesis search
Least commitment:
Instead of keeping around one hypothesis and using backtracking, keep
all consistent hypotheses (and only those).

Incremental: old instances do not have to be rechecked

KNOWLEDGE IN LEARNING

o The preceding section described the simplest setting for inductive

learning. To understand the role of prior knowledge, we need to
talk about the logical relationships among hypotheses, example
descriptions, and classifications.
o Let Descriptions denote the conjunction of all the example
descriptions in the training set, and let Classifications denote the
conjunction of all the example Classifications. Then a Hypothesis
that "explains the observations" must satisfy the following property
(recall that |= means "logically entails"):

Hypothesis ۸ Descriptions |= Classifications

EXPLANATION BASED LEARNING
o Explanation-based learning is a method for extracting general rules from
individual observations.
o The technique of memoization has long been used in computer science
to speed up programs by saving the results of computation. The basic
idea of memo functions is to accumulate a database of input—output
pairs; when the function is called, it first checks the database to see
whether it can avoid solving the problem from scratch.
o Explanation –based learning takes this a good deal further, by creating
general rules that cover an entire class of cases.
EXPLANATION BASED LEARNING

Basic EBL process works as follows

o Given an example, construct a proof that the goal predicate applies to
the example using the available background knowledge.
o In parallel, construct a generalized proof tree for the variabilized goal
using the same inference steps as in the original proof.
o Construct a new rule whose left-hand side consists of the leaves of the
proof tree and whose right-hand side is the variabilized goal (after
applying the necessary bindings from the generalized proof).
o Drop any conditions from the left-hand side that are true regardless of
the values of the variables in the goal.
LEARNING AND USING RELEVANCE
INFORMATION
o The learning algorithm we now present is based on a straightforward
attempt to find the simplest determination consistent with the
observations.
o A determination is therefore consistent with a set of examples if every
pair that matches on the predicates on the left-hand side also matches
on the goal predicate.
LEARNING AND USING RELEVANCE
INFORMATION
INDUCTIVE LOGIC PROGRAMMING
o Inductive logic programming (ILP) combines inductive methods with
the power of first-order representations, concentrating in particular on
the representation of hypotheses as logic programs.

o It has gained popularity for three reasons :

1. ILP offers a rigorous approach to the general knowledge-based
inductive learning problem.
2. ILP offers complete algorithms for inducing general, first-order
theories from examples, which can therefore learn successfully in
domains where attribute-based algorithms are hard to apply.
3. Inductive logic programming produces hypotheses that are
(relatively) easy for humans to read.
INDUCTIVE LOGIC PROGRAMMING
o the general knowledge-based
induction problem is to “solve”
the entailment constraint for
the unknown Hypothesis, given
the Background knowledge and
examples described by
Descriptions and
Classifications .
o The descriptions will consist of
an extended family tree,
described in terms of Mother ,
Father , and Married relations
and Male and Female
properties.
INDUCTIVE LOGIC PROGRAMMING
o The sentences in Classifications depend on the target concept being
learned.
o For example: Grandparent, BrotherInLaw, or Ancestor
o The complete set of Grandparent classifications contains 20 × 20 = 400
conjuncts of the form
INDUCTIVE LOGIC PROGRAMMING
Hypothesis
INDUCTIVE LOGIC PROGRAMMING

Decision-Tree-Learning
o Grandparent (⟨Mum , Charles ⟩) . . .
o FirstElementIsMotherOfElizabeth(⟨Mum,Charles⟩) .

The reader will certainly have noticed that a little bit of background
knowledge would help in the representation of the Grandparent
definition. For example, if Background included the sentence
Parent(x,y) ⇔ [Mother(x,y)∨Father(x,y)],
then the definition of Grandparent would be reduced to
Grandparent(x,y) ⇔ [∃z Parent(x,z)∧Parent(z,y)]
INDUCTIVE LOGIC PROGRAMMING
INDUCTIVE LOGIC PROGRAMMING

Two principal approaches to ILP:

o Top-down inductive learning method: using a generalization of decision
tree methods
o Inductive learning with inverse deduction: using techniques based on
inverting a resolution proof
INDUCTIVE LOGIC PROGRAMMING

Top-down inductive learning method

Suppose we are trying to learn a definition of the
Grandfather (x, y) predicate
Here are three potential additions:
INDUCTIVE LOGIC PROGRAMMING

Inductive learning with inverse deduction

Inverse resolution is based on the observation that if the example Classifications
follow from Background ∧ Hypothesis ∧ Descriptions, then one must be able to
prove this fact by resolution (because resolution is complete). A family tree
example
PASSIVE REINFORCEMENT LEARNING

 An autonomous agent should learn to choose optimal actions in each

state to achieve its goals
 The agent learns how to achieve that goal by trial-and-error
interactions with its environment
 Passive learning the agent imply watches the world going by and tries
to learn the utilities of being in various states
 Active learning the agent not simply watches, but also acts.
PASSIVE REINFORCEMENT LEARNING

The agent’s policy π is fixed: in state s, it always executes the action π(s).
Its goal is simply to learn how good the policy is—to learn the utility
function Uπ(s).
PASSIVE REINFORCEMENT LEARNING

transition model P’(s|s, a), which specifies the probability of reaching

state s from state s after doing action a;
R(s) it the reward function,
The agent executes a set of trials in the environment using its policy π. In
each trial, the agent starts in state (1,1) and experiences a sequence of
state transitions until it reaches one of the terminal states, (4,2) or (4,3).
Its percepts supply both the current state and the reward received in that
state. Typical trials might look like this:
ACTIVE REINFORCEMENT LEARNING

An active agent must consider

 What action to take?
 What their outcomes maybe?

Update utility equation

APPLICATION OF REINFORCEMENT
LEARNING
Game Playing

1. Checkers program written by Arthur Samuel (1959, 1967)

Samuel first used a weighted linear function for the evaluation of
positions, using up to 16 terms at any one time
2. Backgammon program TD-GAMMON (1992)
The TD-GAMMON project was an attempt to learn from self-play
alone. The only reward signal was given at the end of each game.
TD-GAMMON learned to play considerably better than
NEUROGAMMON, even though the input representation contained
just the raw board position with no computed features. This took
about 200,000 training games and two weeks of computer time.
APPLICATION OF REINFORCEMENT
LEARNING
Robot Control

1. BOXES algorithm (Michie and Chambers 1968)

BOXES was implemented with real cart and pole. The algorithm first
discretized the four-dimensional state space into boxes. Negative
reinforcement was associated with the final action in the final box
and then propagated back through the sequence.
2. PEGASUS algorithm (Bagnell and Schneider, 2001)
Application of reinforcement learning to helicopter flight
SUMMARY
o The use of prior knowledge in learning leads to a picture of
cumulative learning, in which learning agents improve their
learning ability as they acquire more knowledge.
o Explanation-based learning (EBL) extracts general rules from single
examples by explaining the examples and generalizing the
explanation. It provides a deductive method for knowledge into
useful, efficient, special -purpose expertise.
o Relevance-based learning (RBL) uses prior knowledge in the form of
determinations to identify the relevant attributes, thereby
generating a reduced hypothesis space and speeding up learning.
RBL also allows deductive generalizations from single examples.
SUMMARY
o Inductive logic programming (ILP) techniques perform on
knowledge that is expressed in first-order logic. ILP methods can
learn relational knowledge that is not expressible in attribute-based
systems
o The overall agent design dictates the kind of information that must
be learned. The three main designs we covered were the model-
based design, using a model P and a utility function U ; the model-
free design, using an action-utility function Q; and the reflex
design, using a policy π.
o When the learning agent is responsible for selecting actions while it
learns, it must trade off the estimated value of those actions
against the potential for learning useful new information. An exact
solution of the exploration problem is infeasible, but some simple
heuristics do a reasonable job
REFERENCES

Stuart Russell, Peter Norvig,. 2010. Artificial intelligence : a modern

approach. PE. New Jersey. ISBN:9780132071482, Chapter 19
Knowledge in Learning and Human Learning:
http://l3d.cs.colorado.edu/courses/AI-96/learning-2.pdf
Scaling Learning Algorithms towards AI:
http://yann.lecun.com/exdb/publis/pdf/bengio-lecun-07.pdf
https://slideplayer.com/slide/15478257/
https://www.slideshare.net/ersaranya/reinforcement-learning-7313
ThankYOU...

Hypothesis Testing For The Difference of Means
No ratings yet
Hypothesis Testing For The Difference of Means
11 pages
Multiple Choice Questions On Research Methodology - MCQ Biology - Learning Biology Through MCQs
No ratings yet
Multiple Choice Questions On Research Methodology - MCQ Biology - Learning Biology Through MCQs
4 pages
AIML Module - 03
No ratings yet
AIML Module - 03
34 pages
Module 2 Principle of AI
No ratings yet
Module 2 Principle of AI
15 pages
NN DL
No ratings yet
NN DL
1 page
Sigmoid Function: Soft Computing Assignment
100% (1)
Sigmoid Function: Soft Computing Assignment
12 pages
Unit V - AI
No ratings yet
Unit V - AI
41 pages
CS-1351 Artificial Intelligence - Two Marks
100% (1)
CS-1351 Artificial Intelligence - Two Marks
24 pages
UNIT-V NLP
No ratings yet
UNIT-V NLP
25 pages
Cognitive Computing (Course Code: 18CS3272) : CO1 - Session4 Session Topic: The Elements of A Cognitive System
No ratings yet
Cognitive Computing (Course Code: 18CS3272) : CO1 - Session4 Session Topic: The Elements of A Cognitive System
9 pages
Unit 3 AI Srs 13-14
No ratings yet
Unit 3 AI Srs 13-14
45 pages
Unit 4 Knowledge Representation
No ratings yet
Unit 4 Knowledge Representation
13 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
AI-Unit 5
100% (1)
AI-Unit 5
6 pages
Question Bank For CAT1 - 2mks
No ratings yet
Question Bank For CAT1 - 2mks
36 pages
Expert System in AI
No ratings yet
Expert System in AI
11 pages
NLP MCQ 153 Out of 427 - Part One
No ratings yet
NLP MCQ 153 Out of 427 - Part One
30 pages
Lecture 3 Search Strategies in Artificial Intelligence
No ratings yet
Lecture 3 Search Strategies in Artificial Intelligence
18 pages
Unit-6: Pipeline & Vector Processing
No ratings yet
Unit-6: Pipeline & Vector Processing
41 pages
18AI61
No ratings yet
18AI61
3 pages
Chapter 8 - Software Testing
No ratings yet
Chapter 8 - Software Testing
20 pages
Unit 2 (Second Order Methods)
No ratings yet
Unit 2 (Second Order Methods)
9 pages
R22B Tech CSE (AIML) IandIIYearSyllabus PDF
No ratings yet
R22B Tech CSE (AIML) IandIIYearSyllabus PDF
65 pages
Cs8079 - Hci QB Unit 4
No ratings yet
Cs8079 - Hci QB Unit 4
23 pages
Unit-8: Natural Language: Processing
No ratings yet
Unit-8: Natural Language: Processing
16 pages
Representing Knowledge Using
No ratings yet
Representing Knowledge Using
22 pages
What Is NLP?: Natural Language Processing in AI
No ratings yet
What Is NLP?: Natural Language Processing in AI
5 pages
CCS355 Neural Networks and Deep Learning
No ratings yet
CCS355 Neural Networks and Deep Learning
142 pages
NLP Unit 1 Notes
100% (1)
NLP Unit 1 Notes
19 pages
F-IoT Unit-5
No ratings yet
F-IoT Unit-5
50 pages
Unit 5 1
No ratings yet
Unit 5 1
18 pages
Unit 5 - Compiler Design - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Compiler Design - WWW - Rgpvnotes.in
20 pages
Soft Computing UNIT 3
No ratings yet
Soft Computing UNIT 3
10 pages
1-NLP - Lab Manual
No ratings yet
1-NLP - Lab Manual
15 pages
Unit 4 NLP Notes
No ratings yet
Unit 4 NLP Notes
35 pages
AI Unit 4 QA
No ratings yet
AI Unit 4 QA
22 pages
Unit 4 - Software Engineering - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Software Engineering - WWW - Rgpvnotes.in
12 pages
Gujarat Technological University: Computer Engineering Machine Learning SUBJECT CODE: 3710216
No ratings yet
Gujarat Technological University: Computer Engineering Machine Learning SUBJECT CODE: 3710216
2 pages
Ai Unit 4
No ratings yet
Ai Unit 4
23 pages
Deep Learning Techniques Notes
No ratings yet
Deep Learning Techniques Notes
42 pages
Learning: Chapter 17: Rich & Knight
No ratings yet
Learning: Chapter 17: Rich & Knight
30 pages
NLP Unit-Ii
No ratings yet
NLP Unit-Ii
118 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
Cp5151 Advanced Data Structures and Algorithims
No ratings yet
Cp5151 Advanced Data Structures and Algorithims
3 pages
Unit-5 Part C 1) Explain The Q Function and Q Learning Algorithm Assuming Deterministic Rewards and Actions With Example. Ans)
No ratings yet
Unit-5 Part C 1) Explain The Q Function and Q Learning Algorithm Assuming Deterministic Rewards and Actions With Example. Ans)
11 pages
Unit-3-Second Chapter
No ratings yet
Unit-3-Second Chapter
9 pages
UNIT 3 Language Modelling
No ratings yet
UNIT 3 Language Modelling
15 pages
CS 3 - Problem Solving Agent
No ratings yet
CS 3 - Problem Solving Agent
80 pages
NLP Lab Manual Updated
No ratings yet
NLP Lab Manual Updated
34 pages
Information Retrieval Systems U6
No ratings yet
Information Retrieval Systems U6
13 pages
Unit 3 - Soft Computing
100% (1)
Unit 3 - Soft Computing
17 pages
UNIT-4 Uncertainty in Artificial Intelligence
No ratings yet
UNIT-4 Uncertainty in Artificial Intelligence
38 pages
Lec01 Conceptlearning
100% (1)
Lec01 Conceptlearning
49 pages
Chapter 16 - MCQ
No ratings yet
Chapter 16 - MCQ
5 pages
Machine Learning QB
No ratings yet
Machine Learning QB
3 pages
Chapter 7
No ratings yet
Chapter 7
49 pages
Artificial Intelligence Module 5
No ratings yet
Artificial Intelligence Module 5
23 pages
Uncertainty AI
No ratings yet
Uncertainty AI
45 pages
NLP Unit-2 Notes
No ratings yet
NLP Unit-2 Notes
45 pages
Textbook of Engineering Chemistry
From Everand
Textbook of Engineering Chemistry
C. Parameswara Murthy
No ratings yet
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
From Everand
The Datadog Handbook: A Guide to Monitoring, Metrics, and Tracing
Robert Johnson
No ratings yet
ML-U5
No ratings yet
ML-U5
6 pages
Experimental Psychology
No ratings yet
Experimental Psychology
36 pages
ECON6001: Applied Econometrics S&W: Chapter 4: Linear Regression With One Regressor, An Introduction Dr. Gedeon Lim
No ratings yet
ECON6001: Applied Econometrics S&W: Chapter 4: Linear Regression With One Regressor, An Introduction Dr. Gedeon Lim
59 pages
06 Kruskal Wallis Test
No ratings yet
06 Kruskal Wallis Test
3 pages
The Great Puzzle: Who in The World Am I? The Quest For Identity in Alice's Adventures in Wonderland
No ratings yet
The Great Puzzle: Who in The World Am I? The Quest For Identity in Alice's Adventures in Wonderland
73 pages
Quantitative and Qualitative Research: Lesson 3
No ratings yet
Quantitative and Qualitative Research: Lesson 3
4 pages
Course Syllabus
No ratings yet
Course Syllabus
9 pages
Regression Statistics
No ratings yet
Regression Statistics
8 pages
Experimental Psychology - Midterm
No ratings yet
Experimental Psychology - Midterm
9 pages
Ajit Kumarroy: Aguidetoresearchmethodology For Beginners
No ratings yet
Ajit Kumarroy: Aguidetoresearchmethodology For Beginners
43 pages
Perhitungan Nilai Kappa
No ratings yet
Perhitungan Nilai Kappa
1 page
Week 3-Chi-Square
No ratings yet
Week 3-Chi-Square
3 pages
MATH 403 EDA Chapter 8
No ratings yet
MATH 403 EDA Chapter 8
19 pages
Hypothesis Testing in The Multiple Regression
No ratings yet
Hypothesis Testing in The Multiple Regression
23 pages
Hasil Analisis Data
No ratings yet
Hasil Analisis Data
8 pages
Managerial Economics
No ratings yet
Managerial Economics
134 pages
MBA Business Statistics 2021
No ratings yet
MBA Business Statistics 2021
9 pages
Daskom A - Athila Zahra A.K - L011191035
No ratings yet
Daskom A - Athila Zahra A.K - L011191035
7 pages
Probability and Statistics ch7
No ratings yet
Probability and Statistics ch7
19 pages
Tutorial 4
No ratings yet
Tutorial 4
5 pages
Multivariate Regression
No ratings yet
Multivariate Regression
20 pages
SPSS Practice T Test
100% (1)
SPSS Practice T Test
3 pages
This Study Resource Was: FS 2: Episode 6: Deductive & Inductive Methods of Teaching
100% (3)
This Study Resource Was: FS 2: Episode 6: Deductive & Inductive Methods of Teaching
6 pages
Multicollinearity
No ratings yet
Multicollinearity
26 pages
Econometrics I: Specification Tests: Dean Fantazzini
No ratings yet
Econometrics I: Specification Tests: Dean Fantazzini
12 pages
Get Narratives of Islamic legal theory 1st Edition Ahmed PDF ebook with Full Chapters Now
100% (11)
Get Narratives of Islamic legal theory 1st Edition Ahmed PDF ebook with Full Chapters Now
67 pages
(Routledge Revivals.) Scheffler, Israel - The Anatomy of Inquiry - Philosophical Studies in The Theory of Science-Routledge (2014)
No ratings yet
(Routledge Revivals.) Scheffler, Israel - The Anatomy of Inquiry - Philosophical Studies in The Theory of Science-Routledge (2014)
499 pages
(eBook PDF) Exploring Family Theories 4th Edition 2024 scribd download
100% (4)
(eBook PDF) Exploring Family Theories 4th Edition 2024 scribd download
56 pages
10 Multicollinearity&Het
No ratings yet
10 Multicollinearity&Het
8 pages