Author: Daley, Brett : Search

Applied Filters

People

Conferences

Publication Date

6 Results for: Author: Daley, BrettEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,765,699 records)|Limit your search to The ACM Full-Text Collection (758,513 records)

Showing 1 - 6of6 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
July 2023
Trajectory-aware eligibility traces for off-policy reinforcement learning
ICML'23: Proceedings of the 40th International Conference on Machine LearningArticle No.: 273, Pages 6818–6835

Off-policy learning from multistep returns is crucial for sample-efficient reinforcement learning, but counteracting off-policy bias without exacerbating variance is challenging. Classically, off-policy bias is corrected in a per-decision manner: past ...
0
Metrics
Total Citations0
article
Free
June 2023
On Centralized Critics in Multi-Agent Reinforcement Learning
Journal of Artificial Intelligence Research (JAIR), Volume 77https://doi.org/10.1613/jair.1.14386
Centralized Training for Decentralized Execution, where agents are trained offline in a centralized fashion and execute online in a decentralized manner, has become a popular approach in Multi-Agent Reinforcement Learning (MARL). In particular, it has ...
0
296
Metrics
Total Citations0
Total Downloads296
Last 12 Months248
Last 6 weeks28
View online with eReader
PDF
extended-abstract
Public Access
May 2021
Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning
AAMAS '21: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent SystemsPages 1486–1488

Deep Reinforcement Learning (RL) methods rely on experience replay to approximate the minibatched supervised learning setting; however, unlike supervised learning where access to lots of training data is crucial to generalization, replay-based deep RL ...
0
35
Metrics
Total Citations0
Total Downloads35
Last 12 Months10
Last 6 weeks5
View online with eReader
PDF
research-article
Public Access
May 2021
Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning
AAMAS '21: Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent SystemsPages 844–852

Centralized Training for Decentralized Execution, where agents are trained offline using centralized information but execute in a decentralized manner online, has gained popularity in the multi-agent reinforcement learning community. In particular, actor-...
5
74
Metrics
Total Citations5
Total Downloads74
Last 12 Months20
Last 6 weeks4
View online with eReader
PDF
research-article
Free
December 2019
Reconciling λ-returns with experience replay
- Brett Daley,
- Christopher Amato
NIPS'19: Proceedings of the 33rd International Conference on Neural Information Processing SystemsDecember 2019, Article No.: 102, Pages 1133–1142

Modern deep reinforcement learning methods have departed from the incremental learning required for eligibility traces, rendering the implementation of the λ-return difficult in this context. In particular, off-policy methods that utilize experience ...
0
24
Metrics
Total Citations0
Total Downloads24
Last 12 Months18
Last 6 weeks7
View online with eReader
PDF
research-article
Free
January 2015
NUPAR: A Benchmark Suite for Modern GPU Architectures
ICPE '15: Proceedings of the 6th ACM/SPEC International Conference on Performance EngineeringPages 253–264https://doi.org/10.1145/2668930.2688046

Heterogeneous systems consisting of multi-core CPUs, Graphics Processing Units (GPUs) and many-core accelerators have gained widespread use by application developers and data-center platform developers. Modern day heterogeneous systems have evolved to ...
25
844
Metrics
Total Citations25
Total Downloads844
Last 12 Months183
Last 6 weeks28
View online with eReader
PDF

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Trajectory-aware eligibility traces for off-policy reinforcement learning

On Centralized Critics in Multi-Agent Reinforcement Learning

Stratified Experience Replay: Correcting Multiplicity Bias in Off-Policy Reinforcement Learning

Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning

Reconciling λ-returns with experience replay

NUPAR: A Benchmark Suite for Modern GPU Architectures