Continual Domain Randomization

Josifovski, Josip; Auddy, Sayantan; Malmir, Mohammadhossein; Piater, Justus; Knoll, Alois; Navarro-Guerrero, Nicolás

Computer Science > Robotics

arXiv:2403.12193 (cs)

[Submitted on 18 Mar 2024 (v1), last revised 27 Aug 2024 (this version, v2)]

Title:Continual Domain Randomization

Authors:Josip Josifovski, Sayantan Auddy, Mohammadhossein Malmir, Justus Piater, Alois Knoll, Nicolás Navarro-Guerrero

View PDF HTML (experimental)

Abstract:Domain Randomization (DR) is commonly used for sim2real transfer of reinforcement learning (RL) policies in robotics. Most DR approaches require a simulator with a fixed set of tunable parameters from the start of the training, from which the parameters are randomized simultaneously to train a robust model for use in the real world. However, the combined randomization of many parameters increases the task difficulty and might result in sub-optimal policies. To address this problem and to provide a more flexible training process, we propose Continual Domain Randomization (CDR) for RL that combines domain randomization with continual learning to enable sequential training in simulation on a subset of randomization parameters at a time. Starting from a model trained in a non-randomized simulation where the task is easier to solve, the model is trained on a sequence of randomizations, and continual learning is employed to remember the effects of previous randomizations. Our robotic reaching and grasping tasks experiments show that the model trained in this fashion learns effectively in simulation and performs robustly on the real robot while matching or outperforming baselines that employ combined randomization or sequential randomization without continual learning. Our code and videos are available at this https URL.

Comments:	Accepted at IROS 2024. Equal contribution from first two authors
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2403.12193 [cs.RO]
	(or arXiv:2403.12193v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2403.12193

Submission history

From: Sayantan Auddy [view email]
[v1] Mon, 18 Mar 2024 19:04:56 UTC (1,158 KB)
[v2] Tue, 27 Aug 2024 16:10:42 UTC (1,425 KB)

Computer Science > Robotics

Title:Continual Domain Randomization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Continual Domain Randomization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators