Efficient Transformers in Reinforcement Learning using Actor-Learner Distillation

This work develops an Actor-Learner Distillation procedure that leverages a continual form of distillation that transfers learning progress from a large capacity learner model to a small capacity actor model in the context of partially-observable environments.

Bookmark

Add

Share

Cite

Sun Apr 04 2021

Citations

https://www.semanticscholar.org/paper/cd37fee4da0d4483322d6fa3cc67af9ed8c07be6

by Emilio Parisotto, R. Salakhutdinov

CHAT WITH RESEARCH

QUESTIONS & ANSWERS

TL;DR

AI STORY

ABSTRACT

PAPER

Research is provided by Semantic Scholar and AI-generated text may at times produce inaccurate results.
Information provided on this site does not constitute legal, financial, medical, or any other professional advice.

DATA LICENSING

Search and article data is provided under CC BY-NC or ODC-BY and via The Semantic Scholar Open Data Platform. Read more at Kinney, Rodney Michael et al. “The Semantic Scholar Open Data Platform.” ArXiv abs/2301.10140 (2023): n. pag.