Furkan Burak mutlu

Bilkent University, Electrical and Electronics Engineering, Undergraduate

Followers

Following

Public Views

Interests

Uploads

Papers by Furkan Burak mutlu

Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients

Download

Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI)

Download

Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay

2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI)

Download

AWD3: Dynamic Reduction of the Estimation Bias

2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI), 2021

Download

Parameter-Free Deterministic Reduction of the Estimation Bias in Continuous Control

ArXiv, 2021

Approximation of the value functions in valuebased deep reinforcement learning systems induces ov... more Approximation of the value functions in valuebased deep reinforcement learning systems induces overestimation bias, resulting in suboptimal policies. We show that when the reinforcement signals received by the agents have a high variance, deep actor-critic approaches that overcome the overestimation bias lead to a substantial underestimation bias. We introduce a parameter-free, novel deep Q-learning variant to reduce this underestimation bias for continuous control. By obtaining fixed weights in computing the critic objective as a linear combination of the approximate critic functions, our Q-value update rule integrates the concepts of Clipped Double Q-learning and Maxmin Q-learning. We test the performance of our improvement on a set of MuJoCo and Box2D continuous control tasks and find that it improves the state-of-the-art and outperforms the baseline algorithms in the majority of the environments.

Download

Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients

Download

Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic Methods

2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI)

Download

Off-Policy Correction for Deep Deterministic Policy Gradient Algorithms via Batch Prioritized Experience Replay

2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI)

Download

AWD3: Dynamic Reduction of the Estimation Bias

2021 IEEE 33rd International Conference on Tools with Artificial Intelligence (ICTAI), 2021

Download

Parameter-Free Deterministic Reduction of the Estimation Bias in Continuous Control

ArXiv, 2021

Download

Furkan Burak mutlu

Uploads

Papers by Furkan Burak mutlu

Log In