Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning

Kearney, Alex; Veeriah, Vivek; Travnik, Jaden; Pilarski, Patrick M.; Sutton, Richard S.

Computer Science > Machine Learning

arXiv:1903.03252 (cs)

[Submitted on 8 Mar 2019]

Title:Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning

Authors:Alex Kearney, Vivek Veeriah, Jaden Travnik, Patrick M. Pilarski, Richard S. Sutton

View PDF

Abstract:There is a long history of using meta learning as representation learning, specifically for determining the relevance of inputs. In this paper, we examine an instance of meta-learning in which feature relevance is learned by adapting step size parameters of stochastic gradient descent---building on a variety of prior work in stochastic approximation, machine learning, and artificial neural networks. In particular, we focus on stochastic meta-descent introduced in the Incremental Delta-Bar-Delta (IDBD) algorithm for setting individual step sizes for each feature of a linear function approximator. Using IDBD, a feature with large or small step sizes will have a large or small impact on generalization from training examples. As a main contribution of this work, we extend IDBD to temporal-difference (TD) learning---a form of learning which is effective in sequential, non i.i.d. problems. We derive a variety of IDBD generalizations for TD learning, demonstrating that they are able to distinguish which features are relevant and which are not. We demonstrate that TD IDBD is effective at learning feature relevance in both an idealized gridworld and a real-world robotic prediction task.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1903.03252 [cs.LG]
	(or arXiv:1903.03252v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1903.03252

Submission history

From: Alex Kearney [view email]
[v1] Fri, 8 Mar 2019 02:29:22 UTC (8,931 KB)

Computer Science > Machine Learning

Title:Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Feature Relevance Through Step Size Adaptation in Temporal-Difference Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators