Open Loop Execution of Tree-Search Algorithms, extended version

Lecarpentier, Erwan; Infantes, Guillaume; Lesire, Charles; Rachelson, Emmanuel

Computer Science > Machine Learning

arXiv:1805.01367 (cs)

[Submitted on 3 May 2018 (v1), last revised 12 Feb 2019 (this version, v2)]

Title:Open Loop Execution of Tree-Search Algorithms, extended version

Authors:Erwan Lecarpentier, Guillaume Infantes, Charles Lesire, Emmanuel Rachelson

View PDF

Abstract:In the context of tree-search stochastic planning algorithms where a generative model is available, we consider on-line planning algorithms building trees in order to recommend an action. We investigate the question of avoiding re-planning in subsequent decision steps by directly using sub-trees as action recommender. Firstly, we propose a method for open loop control via a new algorithm taking the decision of re-planning or not at each time step based on an analysis of the statistics of the sub-tree. Secondly, we show that the probability of selecting a suboptimal action at any depth of the tree can be upper bounded and converges towards zero. Moreover, this upper bound decays in a logarithmic way between subsequent depths. This leads to a distinction between node-wise optimality and state-wise optimality. Finally, we empirically demonstrate that our method achieves a compromise between loss of performance and computational gain.

Comments:	10 pages, 10 figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1805.01367 [cs.LG]
	(or arXiv:1805.01367v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.01367
Journal reference:	27th International Joint Conference on Artificial Intelligence (IJCAI 2018)

Submission history

From: Erwan Lecarpentier [view email]
[v1] Thu, 3 May 2018 15:20:10 UTC (325 KB)
[v2] Tue, 12 Feb 2019 21:42:21 UTC (325 KB)

Computer Science > Machine Learning

Title:Open Loop Execution of Tree-Search Algorithms, extended version

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Open Loop Execution of Tree-Search Algorithms, extended version

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators