Ai Chapter2 Intelligent Agents Aima
Ai Chapter2 Intelligent Agents Aima
CHAPTER 2
Stuart Russell and Peter Norvig, Artificial
Intelligence: A Modern Approach, Global
Edition 3/E
Outline
2
• Human agent:
– eyes, ears, and other organs for sensors;
– hands, legs, mouth, and other body parts for actuators
• Robotic agent:
– cameras and infrared range finders for sensors
– various motors for actuators
Demos:
http://aima.cs.berkeley.edu/demos.html
• Rationality
– Performance measuring success
– Agents prior knowledge of environment
– Actions that agent can perform
– Agent’s percept sequence to date
• Examples:
– Part-picking robot
⚫ Non-deterministic environments
⚪ Have aspects beyond the control of the agent
⚪ Utility functions have to guess at changes in world
Cross Word Poker Backgammon Taxi Part picking robot Image analysis
Deterministic Stochastic Stochastic driver
Stochastic Stochastic Deterministic
⚫ In non-episodic environments:
⚪ Agent has to plan ahead:
Current choice will affect future actions
Cross Word Poker Backgammon Taxi Part picking robot Image analysis
Sequential Sequential Sequential driver
Sequential Episodic Episodic
Cross Word Poker Backgammon Taxi Part picking robot Image analysis
driver Conti
Discrete Discrete Discrete Conti Conti
Cross Word Poker Backgammon Taxi Part picking robot Image analysis
Single driver Single
Multi Multi Multi Single
Fully
Observable
yes
no
Deterministic no
yes
Certainty: Uncertainty
Search
• Have a goal
⚪ A destination to get to
• Reflex agent breaks when it sees brake lights. Goal based agent
reasons
– Brake light -> car in front is stopping -> I should stop -> I should use brake
⚫ Performance element is
what was previously the
whole agent
⚫ Input sensor
⚫ Output action
⚫ Learning element
⚫ Modifies performance
element.
⚫ Problem generator
⚫ Tries to solve the problem
differently instead of
optimizing.
⚫ Suggests exploring new
actions -> new problems.
⚪ Performance element
How it currently drives
⚪ Taxi driver Makes quick left turn across 3 lanes
Critics observe shocking language by passenger and other drivers
and informs bad action
Learning element tries to modify performance elements for future
Problem generator suggests experiment out something called
Brakes on different Road conditions
⚪ Exploration vs. Exploitation
Learning experience can be costly in the short run
shocking language from other drivers
Less tip
Fewer passengers
Planning
Decision Theory
Action Reinforcement
Learning
Game Theory
Knowledge Learning
Logic Machine Learning
Probability Statistics
Heuristics
Inference
Action
Reinforcement
Learning
Learning