Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
×
If you have planned to achieve one particular goal in a stochastic delayed rewards problem and then someone asks about a different goal what should.
This paper shows that by using a new kind of automata caily generated abstract action hierarchy that with N states, preparing for all of N possible goals ...
If you have planned to achieve one particular goal in a stochastic delayed rewards problem and then someone asks about a di erent goal what should.
In this paper we will use the terminology that a multipolicy π*(x, y) (for all state-pairs (x, y)) maps a state x to the first action it should take in order to ...
Multi-Value-Functions: Efficient Automatic Action Hierarchies for Multiple Goal MDPs. Authors: Author Picture Andrew W. Moore.
Bibliographic details on Multi-Value-Functions: Efficient Automatic Action Hierarchies for Multiple Goal MDPs.
@InProceedings{LIS79, title = {Multi-Value-Functions: Efficient Automatic Action Hierarchies for Multiple Goal {MDP}s}, author = {Andrew Moore and Leemon ...
Multi-goal MDPs (which this hierarchy is developed for) are ... Multi-value-functions: Efficient automatic action hierarchies for multiple goal mdps.
Multi-value-functions: Efficient automatic action hierarchies for multiple goal MDPs. In International Joint. Conference on Artificial Intelligence, pp ...
Jan 17, 2024 · The figure reveals how the corresponding skill hierarchy would enable efficient navigation of the environment at multiple time scales.