Artificial Intelligence and Intelligent Agents (F29AI) MDP I: Intro To Markov Decision Processes
Artificial Intelligence and Intelligent Agents (F29AI) MDP I: Intro To Markov Decision Processes
Artificial Intelligence and Intelligent Agents (F29AI) MDP I: Intro To Markov Decision Processes
Based on slides from Ioannis Konstas @HWU, Verena Rieser @HWU, Dan Klein @UC Berkeley
So far…
• How to formalise a real-world problem?
• How to reach a goal?
• Blind search: DFS, BFS, UCS
• Informed search: Greedy, A*
• How to maximise an outcome if there is an adversary?
• Adversary search: minimax, alpha-beta pruning
• What to do if this adversary (environment) is
probabilistic?
• Expectimax, maximum expected utilities
Today
• Non-deterministic worlds
• Markov Decision Processes (MDPs)
• Time-limited values
Andrey Markov
(1856-1922)