Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Incremental State Aggregation for Value Function Estimation in Reinforcement Learning

Published: 01 October 2011 Publication History

Abstract

In reinforcement learning, large state and action spaces make the estimation of value functions impractical, so a value function is often represented as a linear combination of basis functions whose linear coefficients constitute parameters to be estimated. However, preparing basis functions requires a certain amount of prior knowledge and is, in general, a difficult task. To overcome this difficulty, an adaptive basis function construction technique has been proposed by Keller et. al. recently, but it requires excessive computational cost. We propose an efficient approach to this difficulty, in which the problem of approximating the value function is decomposed into a number of subproblems, each of which can be solved with small computational cost. Computer experiments show that the CPU time needed by our method is much smaller than that by the existing method.

Cited By

View all
  • (2023)Automated Design of Metaheuristics Using Reinforcement Learning Within a Novel General Search FrameworkIEEE Transactions on Evolutionary Computation10.1109/TEVC.2022.319729827:4(1072-1084)Online publication date: 1-Aug-2023
  • (2018)Towards 5G: A Reinforcement Learning-Based Scheduling Solution for Data Traffic ManagementIEEE Transactions on Network and Service Management10.1109/TNSM.2018.286356315:4(1661-1675)Online publication date: 1-Dec-2018
  • (2013)Backward Q-learningEngineering Applications of Artificial Intelligence10.1016/j.engappai.2013.06.01626:9(2184-2193)Online publication date: 1-Oct-2013

Recommendations

Comments

Information & Contributors

Information

Published In

cover image IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics  Volume 41, Issue 5
October 2011
267 pages

Publisher

IEEE Press

Publication History

Published: 01 October 2011

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Automated Design of Metaheuristics Using Reinforcement Learning Within a Novel General Search FrameworkIEEE Transactions on Evolutionary Computation10.1109/TEVC.2022.319729827:4(1072-1084)Online publication date: 1-Aug-2023
  • (2018)Towards 5G: A Reinforcement Learning-Based Scheduling Solution for Data Traffic ManagementIEEE Transactions on Network and Service Management10.1109/TNSM.2018.286356315:4(1661-1675)Online publication date: 1-Dec-2018
  • (2013)Backward Q-learningEngineering Applications of Artificial Intelligence10.1016/j.engappai.2013.06.01626:9(2184-2193)Online publication date: 1-Oct-2013

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media