Artificial Intelligence and Intelligent Agents (F29AI) MDP I: Intro To Markov Decision Processes

Uploaded by

The document discusses Markov Decision Processes (MDPs) which provide a framework for modeling sequential decision making problems under uncertainty. MDPs are defined by states, actions, transition probabilities between states given an action, and reward functions. They allow planning for optimal decisions even when actions have non-deterministic outcomes defined by probabilities. An example of a noisy "grid world" environment is provided where the agent's movement actions do not always produce the intended result. MDPs generalize search problems to stochastic domains and can be solved using techniques like expectimax search or other MDP-specific algorithms.

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Artificial Intelligence and Intelligent Agents (F29AI) MDP I: Intro To Markov Decision Processes

Uploaded by

Aimee Lemma

0% found this document useful (0 votes)

56 views10 pages

Original Title

Intro-MDP(1)

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

0% found this document useful (0 votes)

56 views10 pages

Artificial Intelligence and Intelligent Agents (F29AI) MDP I: Intro To Markov Decision Processes

Uploaded by

Aimee Lemma

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Download as pdf or txt

Jump to Page

You are on page 1of 10

Search inside document

Artificial Intelligence and

Intelligent Agents (F29AI)

MDP I: Intro to Markov Decision

Processes
Arash Eshghi

Based on slides from Ioannis Konstas @HWU, Verena Rieser @HWU, Dan Klein @UC Berkeley
So far…
• How to formalise a real-world problem?
• How to reach a goal?
• Blind search: DFS, BFS, UCS
• Informed search: Greedy, A*
• How to maximise an outcome if there is an adversary?
• Adversary search: minimax, alpha-beta pruning
• What to do if this adversary (environment) is
probabilistic?
• Expectimax, maximum expected utilities
Today
• Non-deterministic worlds
• Markov Decision Processes (MDPs)
• Time-limited values

How can you plan optimally if your

actions are non-deterministic?
(you only have a distribution over the
outcome)
Today
Example: Grid World
• A maze-like problem
• The agent lives in a grid
• Walls block the agent’s path
• Noisy movement: actions do not always go
as planned
• 80% of the time, the action North takes the
agent North (if there is no wall there)
• 10% of the time, North takes the agent West;
10% East
• If there is a wall in the direction the agent would
have been taken, the agent stays put
• The agent receives rewards each time step
• Small “living” reward each step (can be negative)
• Big rewards come at the end (good or bad)
• Goal: maximize sum of rewards
Grid World Actions
Deterministic Stochastic
Grid World Grid World
Markov Decision Processes

• An MDP is defined by:

• A set of states s Î S
• A set of actions a Î A
• A transition function T(s, a, s’)
• Prob that action a from s leads to s’
i.e., P(s’ | s,a)
• Also called “the model”
• A reward function R(s, a, s’)
• Sometimes just R(s) or R(s’)
• A start state (or distribution)
• Maybe a terminal state

• MDPs are a family of non-deterministic search problems

• One way to solve them is with expectimax search – but we will
have a new tool soon
7
Video of Demo Gridworld
Manual Intro
Gridworld Demo
What is Markov about MDPs?

• “Markov” generally means that given the present state,

the future and the past are independent.

• For Markov decision processes, “Markov” means action

outcomes depend only on the current state

Andrey Markov
(1856-1922)

Designing Rewad System at Disk Drive
Document5 pages
Designing Rewad System at Disk Drive
Livia Safira
No ratings yet
Scientology: Integrity and Honesty
Document41 pages
Scientology: Integrity and Honesty
Official Church of Scientology
86% (7)
After Reading This Book - Agile Leadership Toolkit - Learning To Thrive With Self-Managing Teams
Document5 pages
After Reading This Book - Agile Leadership Toolkit - Learning To Thrive With Self-Managing Teams
Joao Paulo Moura
0% (1)
NEC Article 250
Document42 pages
NEC Article 250
unknown_3
100% (1)
Jack Halberstam - Wild Things - The Disorder of Desire-Duke University Press Books (2020)
Document230 pages
Jack Halberstam - Wild Things - The Disorder of Desire-Duke University Press Books (2020)
Mariano Veliz
100% (4)
Non Deterministic Search: CS 188: Artificial Intelligence
Document6 pages
Non Deterministic Search: CS 188: Artificial Intelligence
Lokesh Sharma
No ratings yet
Lecture 3 Problem Solving
Document49 pages
Lecture 3 Problem Solving
Harris Chikunya
No ratings yet
L03 Problem Solving As Search I
Document66 pages
L03 Problem Solving As Search I
The Gamer Last night
No ratings yet
L03 Problem Solving As Search Uninformed
Document65 pages
L03 Problem Solving As Search Uninformed
pedanticwiles
No ratings yet
L03 Problem Solving As Search I
Document59 pages
L03 Problem Solving As Search I
arabickathu
No ratings yet
1.4 Introduction To Markov Decision Processes
Document15 pages
1.4 Introduction To Markov Decision Processes
Tri Nguyen
No ratings yet
Chapter 3
Document52 pages
Chapter 3
Dawit Sebhat
No ratings yet
Lecture7 MDPs I
Document9 pages
Lecture7 MDPs I
Mamunur Rashid
No ratings yet
DSA5102_lecture11
Document44 pages
DSA5102_lecture11
gjpnwmdpz7
No ratings yet
Ai Unit2 Updated12
Document154 pages
Ai Unit2 Updated12
Priya Yadav
No ratings yet
Unit-2.4 Searching With Partial Observations - CSPs - Back Tracking
Document42 pages
Unit-2.4 Searching With Partial Observations - CSPs - Back Tracking
mani111111
100% (1)
Chapter 2
Document21 pages
Chapter 2
yigiblirujjjrxpthj
No ratings yet
Topic 01 Searching
Document113 pages
Topic 01 Searching
ecobeastsofficial
No ratings yet
Lecture 3 - Problem Solving by Search
Document27 pages
Lecture 3 - Problem Solving by Search
MARIELLA TRACY BANDONG
No ratings yet
Solving Problems by Searching & Constraint Satisfaction Problem
Document53 pages
Solving Problems by Searching & Constraint Satisfaction Problem
Mustefa Mohammed
No ratings yet
Module 2
Document73 pages
Module 2
Christin T.Kunjumon
No ratings yet
Solving Problems by Searching: Artificial Intelligence
Document43 pages
Solving Problems by Searching: Artificial Intelligence
Dai Trong
No ratings yet
Artificial Intelligence Lecture No. 6
Document30 pages
Artificial Intelligence Lecture No. 6
Syed Sami
No ratings yet
Artificial Intelligence Chapter 3: Solving Problems by Searching
Document32 pages
Artificial Intelligence Chapter 3: Solving Problems by Searching
Faizan
No ratings yet
17 - Markov Decision Processes.pptx
Document59 pages
17 - Markov Decision Processes.pptx
sanjitdfd
No ratings yet
RNN LSTM
Document72 pages
RNN LSTM
5049 Harishchandra Kumar
No ratings yet
2021 Lecture03 P1 ProblemSolvingBySearching
Document43 pages
2021 Lecture03 P1 ProblemSolvingBySearching
Nguyen Thong
No ratings yet
L12 Markov Decision Processes
Document64 pages
L12 Markov Decision Processes
Abhijeet Choudhary
No ratings yet
2023 Lecture03 P1 ProblemSolvingBySearching
Document43 pages
2023 Lecture03 P1 ProblemSolvingBySearching
trần văn quyết
No ratings yet
Team Trainwreck Final Report: Stephen Worlow Sam Rush Michael Lauria
Document13 pages
Team Trainwreck Final Report: Stephen Worlow Sam Rush Michael Lauria
zatricion
No ratings yet
Solving Problems by Searching Final
Document69 pages
Solving Problems by Searching Final
asnake ketema
No ratings yet
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
Document47 pages
2EL1730-ML-Lecture04-Non Parametric Learning and Nearest Neighbor
Zakaria Mennioui
No ratings yet
FALLSEM2022-23 CBS3004 ETH VL2022230104390 Reference Material I 10-08-2022 2.1 Problem Solving in AI
Document36 pages
FALLSEM2022-23 CBS3004 ETH VL2022230104390 Reference Material I 10-08-2022 2.1 Problem Solving in AI
Bhimavarapu Sreekar Reddy
No ratings yet
Lecture06 Informed Search (Part 2)
Document34 pages
Lecture06 Informed Search (Part 2)
Dream Maker
No ratings yet
Solving Problems by Searching: Dr. Azhar Mahmood
Document38 pages
Solving Problems by Searching: Dr. Azhar Mahmood
Mansoor Qaisrani
No ratings yet
Lecture3 Searching
Document45 pages
Lecture3 Searching
Risinu Wijesinghe
No ratings yet
61 Report
Document12 pages
61 Report
Anika Tabassum
No ratings yet
37 RL
Document18 pages
37 RL
prachi parihar
No ratings yet
ResourcesAllocaation Kuliah
Document15 pages
ResourcesAllocaation Kuliah
Mhd. Fathoni
No ratings yet
DHT
Document26 pages
DHT
Ioio92
No ratings yet
Session 3 - Local Search
Document34 pages
Session 3 - Local Search
Alfian Rizki
No ratings yet
788XF14L17 SLAMx
Document103 pages
788XF14L17 SLAMx
Vũ Mạnh Cường
No ratings yet
Yapay Zeka - 3
Document30 pages
Yapay Zeka - 3
gradiotest3
No ratings yet
lecture21
Document29 pages
lecture21
teamsienna24
No ratings yet
Unit 2 Search
Document51 pages
Unit 2 Search
ujjawalnegi14
No ratings yet
Chapter 3 - Searching-Part 1
Document103 pages
Chapter 3 - Searching-Part 1
Mohammed A. Al Sattari
No ratings yet
2EL1730 ML Lecture07 Neural Networks
Document65 pages
2EL1730 ML Lecture07 Neural Networks
Zakaria Mennioui
No ratings yet
Machine Learning For NLP
Document58 pages
Machine Learning For NLP
vothaianh18081997
No ratings yet
Neural Metwork: Institut Teknologi Sepuluh Nopember (ITS) Surabaya - Indonesia
Document43 pages
Neural Metwork: Institut Teknologi Sepuluh Nopember (ITS) Surabaya - Indonesia
RIZKA FIDYA PERMATASARI 06211940005004
No ratings yet
Unit 5 Deep Learning
Document24 pages
Unit 5 Deep Learning
Praveen Kumar
No ratings yet
Slides
Document69 pages
Slides
sfaisalaliuit
No ratings yet
Lec04 Search Intro-1689184320071
Document19 pages
Lec04 Search Intro-1689184320071
tahasinaalam02
No ratings yet
Machine Learning For Astronomy: Rob Fergus
Document80 pages
Machine Learning For Astronomy: Rob Fergus
Mastering Zinc Oxide
No ratings yet
CMP 446 Basic Search Strategies - Module 1
Document29 pages
CMP 446 Basic Search Strategies - Module 1
geehustle06
No ratings yet
3 Uninformed Search
Document77 pages
3 Uninformed Search
Pratik Pradip Sarode
No ratings yet
Artificial Intelligence: Problem Solving by Search Module - 2
Document253 pages
Artificial Intelligence: Problem Solving by Search Module - 2
viraj gupta
No ratings yet
AI Uninformed Searches
Document58 pages
AI Uninformed Searches
Shabahat Zia
No ratings yet
Artificial Intelligence: CS482, CS682, MW 1 - 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis
Document28 pages
Artificial Intelligence: CS482, CS682, MW 1 - 2:15, SEM 201, MS 227 Prerequisites: 302, 365 Instructor: Sushil Louis
Nitheesh
No ratings yet
AS02
Document16 pages
AS02
rajan chaudhary
No ratings yet
015.search Formulation Problems Basic Strategies
Document121 pages
015.search Formulation Problems Basic Strategies
supriya
No ratings yet
CS221 - Artificial Intelligence - Machine Learning - 1 Overview
Document16 pages
CS221 - Artificial Intelligence - Machine Learning - 1 Overview
Ardiansyah Mochamad Nugraha
No ratings yet
Problem Solving and Search: 6.825 Techniques in Artificial Intelligence
Document118 pages
Problem Solving and Search: 6.825 Techniques in Artificial Intelligence
HayderALMakhzomi
No ratings yet
Slide bài giảng nhập môn Robot và Trí tuệ nhân tạo hcmute
Document177 pages
Slide bài giảng nhập môn Robot và Trí tuệ nhân tạo hcmute
Lương Minh Nhật
No ratings yet
learning1
Document68 pages
learning1
Surya Basnet
No ratings yet
Markov Decision Process: Fundamentals and Applications
From Everand
Markov Decision Process: Fundamentals and Applications
Fouad Sabry
No ratings yet
Cy 2021 Operational Key Performance Indicators (Kpis) : Annex A
Document5 pages
Cy 2021 Operational Key Performance Indicators (Kpis) : Annex A
Earl Patrick
No ratings yet
Paints and Varnishes: ISO Standards Handbook
Document5 pages
Paints and Varnishes: ISO Standards Handbook
shankar.ouct
No ratings yet
BRSA Group Training Brochure 4 Panel Small LR
Document8 pages
BRSA Group Training Brochure 4 Panel Small LR
br
No ratings yet
Jeemain - Ntaonline.in Frontend Web Advancecityintimationslip Admit-Card
Document5 pages
Jeemain - Ntaonline.in Frontend Web Advancecityintimationslip Admit-Card
yamhham05
No ratings yet
SYLLABUS Communication Arts
Document8 pages
SYLLABUS Communication Arts
Phee May Diel
No ratings yet
Low Test Coverage Debug
Document5 pages
Low Test Coverage Debug
Akash
No ratings yet
Complete Download Cerebral Palsy in Infancy Optimizing Growth Development and Motor Performance Roberta B Shepherd PDF All Chapters
Document77 pages
Complete Download Cerebral Palsy in Infancy Optimizing Growth Development and Motor Performance Roberta B Shepherd PDF All Chapters
storkcivilel
100% (7)
Formulario de Principales Relaciones Geométricas en Engranes
Document2 pages
Formulario de Principales Relaciones Geométricas en Engranes
Luis Eduardo Rodriguez Garrafa
No ratings yet
Owners: White Cement Concrete and Colored Concrete Construction
Document4 pages
Owners: White Cement Concrete and Colored Concrete Construction
sonofalexander
No ratings yet
Microsoft v. Corel - Complaint
Document163 pages
Microsoft v. Corel - Complaint
Sarah Burstein
No ratings yet
Hernandez Jaran Phil
Document2 pages
Hernandez Jaran Phil
api-284939442
No ratings yet
Model Railway Signal Project
Document1 page
Model Railway Signal Project
vinaykumarverma21
No ratings yet
Animation (NXPowerLite Copy) PDF
Document2 pages
Animation (NXPowerLite Copy) PDF
alpar7377
No ratings yet
End of Chapter 11
Document13 pages
End of Chapter 11
1394888nguy8n8th88la
No ratings yet
Product and Company Identification: Safety Data Sheet
Document6 pages
Product and Company Identification: Safety Data Sheet
rafael_figueroa
100% (1)
Chapter Wise Physics 9
Document9 pages
Chapter Wise Physics 9
RAI SHAHZAIB MUMTAZ
No ratings yet
Pump Pipe Line Size Calculation
Document3 pages
Pump Pipe Line Size Calculation
bharath
No ratings yet
02-08-20 - Incoming - Jr.iit - Star Co-Sc - Iit Jee Adv - 2017 - P-I - Wat-10 - QP
Document17 pages
02-08-20 - Incoming - Jr.iit - Star Co-Sc - Iit Jee Adv - 2017 - P-I - Wat-10 - QP
ASHUTOSH PATNAIK
No ratings yet
Akril Decorative Panels: Brilliant Performance
Document4 pages
Akril Decorative Panels: Brilliant Performance
MathKeys
No ratings yet
Data Sheet 6ES7431-1KF20-0AB0: Supply Voltage
Document3 pages
Data Sheet 6ES7431-1KF20-0AB0: Supply Voltage
melad yousef
No ratings yet
Effectiveness of Flexible Learning On The Academic Performance of Students
Document6 pages
Effectiveness of Flexible Learning On The Academic Performance of Students
International Journal of Innovative Science and Research Technology
No ratings yet
Flex Fields
Document21 pages
Flex Fields
Redrouthu Jayaprakash
No ratings yet
My Future Innovation: Name: Syifa Rista Nur Alfia Class: 12 Accounting
Document5 pages
My Future Innovation: Name: Syifa Rista Nur Alfia Class: 12 Accounting
Syifa Rista
No ratings yet
7907 - 10 Explore Speed Post Domestic Service - 7 Explore Speed Post Domestic Service
Document3 pages
7907 - 10 Explore Speed Post Domestic Service - 7 Explore Speed Post Domestic Service
Vishal Gavli
No ratings yet
Inbound
Document250 pages
Inbound
Djihad Said
No ratings yet