Cooperative Deep $Q$-learning Framework for Environments Providing Image Feedback

Raghavan, Krishnan; Narayanan, Vignesh; Sarangapani, Jagannathan

Electrical Engineering and Systems Science > Systems and Control

arXiv:2110.15305 (eess)

[Submitted on 28 Oct 2021]

Title:Cooperative Deep $Q$-learning Framework for Environments Providing Image Feedback

Authors:Krishnan Raghavan, Vignesh Narayanan, Jagannathan Sarangapani

View PDF

Abstract:In this paper, we address two key challenges in deep reinforcement learning setting, sample inefficiency and slow learning, with a dual NN-driven learning approach. In the proposed approach, we use two deep NNs with independent initialization to robustly approximate the action-value function in the presence of image inputs. In particular, we develop a temporal difference (TD) error-driven learning approach, where we introduce a set of linear transformations of the TD error to directly update the parameters of each layer in the deep NN. We demonstrate theoretically that the cost minimized by the error-driven learning (EDL) regime is an approximation of the empirical cost and the approximation error reduces as learning progresses, irrespective of the size of the network. Using simulation analysis, we show that the proposed methods enables faster learning and convergence and requires reduced buffer size (thereby increasing the sample efficiency).

Subjects:	Systems and Control (eess.SY); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2110.15305 [eess.SY]
	(or arXiv:2110.15305v1 [eess.SY] for this version)
	https://doi.org/10.48550/arXiv.2110.15305

Submission history

From: Raghavan Krishnan [view email]
[v1] Thu, 28 Oct 2021 17:12:41 UTC (1,982 KB)

Electrical Engineering and Systems Science > Systems and Control

Title:Cooperative Deep $Q$-learning Framework for Environments Providing Image Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Systems and Control

Title:Cooperative Deep $Q$-learning Framework for Environments Providing Image Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators