Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/340534.340892guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Gradient descent for general reinforcement learning

Published: 20 July 1999 Publication History

Abstract

No abstract available.

Cited By

View all
  • (2023)General munchausen reinforcement learning with tsallis kullback-leibler divergenceProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668636(57639-57659)Online publication date: 10-Dec-2023
  • (2019)Recursive least-squares temporal difference learning for adaptive traffic signal control at intersectionNeural Computing and Applications10.1007/s00521-017-3066-931:2(1013-1028)Online publication date: 1-Feb-2019
  • (2018)Concise deep reinforcement learning obstacle avoidance for underactuated unmanned marine vesselsNeurocomputing10.1016/j.neucom.2017.06.066272:C(63-73)Online publication date: 10-Jan-2018
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Guide Proceedings
Proceedings of the 1998 conference on Advances in neural information processing systems II
July 1999
1090 pages
ISBN:0262112450

Publisher

MIT Press

Cambridge, MA, United States

Publication History

Published: 20 July 1999

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)General munchausen reinforcement learning with tsallis kullback-leibler divergenceProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668636(57639-57659)Online publication date: 10-Dec-2023
  • (2019)Recursive least-squares temporal difference learning for adaptive traffic signal control at intersectionNeural Computing and Applications10.1007/s00521-017-3066-931:2(1013-1028)Online publication date: 1-Feb-2019
  • (2018)Concise deep reinforcement learning obstacle avoidance for underactuated unmanned marine vesselsNeurocomputing10.1016/j.neucom.2017.06.066272:C(63-73)Online publication date: 10-Jan-2018
  • (2017)An alternative softmax operator for reinforcement learningProceedings of the 34th International Conference on Machine Learning - Volume 7010.5555/3305381.3305407(243-252)Online publication date: 6-Aug-2017
  • (2015)Session Search by Direct Policy LearningProceedings of the 2015 International Conference on The Theory of Information Retrieval10.1145/2808194.2809461(261-270)Online publication date: 27-Sep-2015
  • (2015)Reinforcement Learning for Thermal-aware Many-core Task AllocationProceedings of the 25th edition on Great Lakes Symposium on VLSI10.1145/2742060.2742078(379-384)Online publication date: 20-May-2015
  • (2015)Evolutionary Bilevel Optimization for Complex Control TasksProceedings of the 2015 Annual Conference on Genetic and Evolutionary Computation10.1145/2739480.2754732(871-878)Online publication date: 11-Jul-2015
  • (2015)Deep learning in neural networksNeural Networks10.1016/j.neunet.2014.09.00361:C(85-117)Online publication date: 1-Jan-2015
  • (2011)Improving Gaussian process value function approximation in policy gradient algorithmsProceedings of the 21st international conference on Artificial neural networks - Volume Part II10.5555/2029604.2029633(221-228)Online publication date: 14-Jun-2011
  • (2011)Gradient based algorithms with loss functions and kernels for improved on-policy controlProceedings of the 9th European conference on Recent Advances in Reinforcement Learning10.1007/978-3-642-29946-9_7(30-41)Online publication date: 9-Sep-2011
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media