A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Mitavskiy, Boris; Rowe, Jonathan; Cannings, Chris

Computer Science > Artificial Intelligence

arXiv:1110.4657 (cs)

[Submitted on 20 Oct 2011]

Title:A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Authors:Boris Mitavskiy, Jonathan Rowe, Chris Cannings

View PDF

Abstract:Purpose: In recent years Monte-Carlo sampling methods, such as Monte Carlo tree search, have achieved tremendous success in model free reinforcement learning. A combination of the so called upper confidence bounds policy to preserve the "exploration vs. exploitation" balance to select actions for sample evaluations together with massive computing power to store and to update dynamically a rather large pre-evaluated game tree lead to the development of software that has beaten the top human player in the game of Go on a 9 by 9 board. Much effort in the current research is devoted to widening the range of applicability of the Monte-Carlo sampling methodology to partially observable Markov decision processes with non-immediate payoffs. The main challenge introduced by randomness and incomplete information is to deal with the action evaluation at the chance nodes due to drastic differences in the possible payoffs the same action could lead to. The aim of this article is to establish a version of a theorem that originated from population genetics and has been later adopted in evolutionary computation theory that will lead to novel Monte-Carlo sampling algorithms that provably increase the AI potential. Due to space limitations the actual algorithms themselves will be presented in the sequel papers, however, the current paper provides a solid mathematical foundation for the development of such algorithms and explains why they are so promising.

Comments:	53 pages in size. This work has been recently submitted to the IJICC (International Journal on Intelligent Computing and Cybernetics)
Subjects:	Artificial Intelligence (cs.AI); Discrete Mathematics (cs.DM)
Cite as:	arXiv:1110.4657 [cs.AI]
	(or arXiv:1110.4657v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1110.4657

Submission history

From: Boris Mitavskiy [view email]
[v1] Thu, 20 Oct 2011 22:53:03 UTC (60 KB)

Computer Science > Artificial Intelligence

Title:A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators