Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–21 of 21 results for author: Arulkumaran, K

Searching in archive cs. Search in all archives.
.
  1. Preference-Learning Emitters for Mixed-Initiative Quality-Diversity Algorithms

    Authors: Roberto Gallotta, Kai Arulkumaran, L. B. Soros

    Abstract: In mixed-initiative co-creation tasks, wherein a human and a machine jointly create items, it is important to provide multiple relevant suggestions to the designer. Quality-diversity algorithms are commonly used for this purpose, as they can provide diverse suggestions that represent salient areas of the solution space, showcasing designs with high fitness and wide variety. Because generated sugge… ▽ More

    Submitted 14 April, 2023; v1 submitted 25 October, 2022; originally announced October 2022.

  2. Surrogate Infeasible Fitness Acquirement FI-2Pop for Procedural Content Generation

    Authors: Roberto Gallotta, Kai Arulkumaran, L. B. Soros

    Abstract: When generating content for video games using procedural content generation (PCG), the goal is to create functional assets of high quality. Prior work has commonly leveraged the feasible-infeasible two-population (FI-2Pop) constrained optimisation algorithm for PCG, sometimes in combination with the multi-dimensional archive of phenotypic-elites (MAP-Elites) algorithm for finding a set of diverse… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  3. arXiv:2204.05133  [pdf, other

    cs.AI cs.NE

    On the link between conscious function and general intelligence in humans and machines

    Authors: Arthur Juliani, Kai Arulkumaran, Shuntaro Sasai, Ryota Kanai

    Abstract: In popular media, there is often a connection drawn between the advent of awareness in artificial agents and those same agents simultaneously achieving human or superhuman level intelligence. In this work, we explore the validity and potential application of this seemingly intuitive link between consciousness and intelligence. We do so by examining the cognitive abilities associated with three con… ▽ More

    Submitted 19 July, 2022; v1 submitted 23 March, 2022; originally announced April 2022.

  4. arXiv:2202.12742  [pdf, other

    cs.LG cs.AI

    Learning Relative Return Policies With Upside-Down Reinforcement Learning

    Authors: Dylan R. Ashley, Kai Arulkumaran, Jürgen Schmidhuber, Rupesh Kumar Srivastava

    Abstract: Lately, there has been a resurgence of interest in using supervised learning to solve reinforcement learning problems. Recent work in this area has largely focused on learning command-conditioned policies. We investigate the potential of one such method -- upside-down reinforcement learning -- to work with commands that specify a desired relationship between some scalar value and the observed retu… ▽ More

    Submitted 10 May, 2022; v1 submitted 23 February, 2022; originally announced February 2022.

    Comments: presented at the 5th Multidisciplinary Conference on Reinforcement Learning and Decision Making; 5 pages in main text, 2 figures in main text

    ACM Class: I.2.6

  5. arXiv:2202.11960  [pdf, other

    cs.LG cs.AI

    All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RL

    Authors: Kai Arulkumaran, Dylan R. Ashley, Jürgen Schmidhuber, Rupesh K. Srivastava

    Abstract: Upside down reinforcement learning (UDRL) flips the conventional use of the return in the objective function in RL upside down, by taking returns as input and predicting actions. UDRL is based purely on supervised learning, and bypasses some prominent issues in RL: bootstrapping, off-policy corrections, and discount factors. While previous work with UDRL demonstrated it in a traditional online RL… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

  6. Diversity-based Trajectory and Goal Selection with Hindsight Experience Replay

    Authors: Tianhong Dai, Hengyan Liu, Kai Arulkumaran, Guangyu Ren, Anil Anthony Bharath

    Abstract: Hindsight experience replay (HER) is a goal relabelling technique typically used with off-policy deep reinforcement learning algorithms to solve goal-oriented tasks; it is well suited to robotic manipulation tasks that deliver only sparse rewards. In HER, both trajectories and transitions are sampled uniformly for training. However, not all of the agent's experiences contribute equally to training… ▽ More

    Submitted 8 November, 2021; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: Pacific Rim International Conference on Artificial Intelligence, 2021

  7. arXiv:2108.01867  [pdf, other

    cs.LG cs.NE stat.ML

    A Pragmatic Look at Deep Imitation Learning

    Authors: Kai Arulkumaran, Dan Ogawa Lillrank

    Abstract: The introduction of the generative adversarial imitation learning (GAIL) algorithm has spurred the development of scalable imitation learning approaches using deep neural networks. Many of the algorithms that followed used a similar procedure, combining on-policy actor-critic algorithms with inverse reinforcement learning. More recently there have been an even larger breadth of approaches, most of… ▽ More

    Submitted 19 September, 2023; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: Asian Conference on Machine Learning, 2023

  8. arXiv:2005.09220  [pdf, other

    cs.LG cs.AI stat.ML

    Privileged Information Dropout in Reinforcement Learning

    Authors: Pierre-Alexandre Kamienny, Kai Arulkumaran, Feryal Behbahani, Wendelin Boehmer, Shimon Whiteson

    Abstract: Using privileged information during training can improve the sample efficiency and performance of machine learning systems. This paradigm has been applied to reinforcement learning (RL), primarily in the form of distillation or auxiliary tasks, and less commonly in the form of augmenting the inputs of agents. In this work, we investigate Privileged Information Dropout (\pid) for achieving the latt… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  9. arXiv:1912.08324  [pdf, other

    cs.LG cs.CV cs.NE cs.RO

    Analysing Deep Reinforcement Learning Agents Trained with Domain Randomisation

    Authors: Tianhong Dai, Kai Arulkumaran, Tamara Gerbert, Samyakh Tukra, Feryal Behbahani, Anil Anthony Bharath

    Abstract: Deep reinforcement learning has the potential to train robots to perform complex tasks in the real world without requiring accurate models of the robot or its environment. A practical approach is to train agents in simulation, and then transfer them to the real world. One popular method for achieving transferability is to use domain randomisation, which involves randomly perturbing various aspects… ▽ More

    Submitted 17 February, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

  10. arXiv:1911.09615  [pdf, other

    cs.LG cs.NE stat.ML

    Sample-Efficient Reinforcement Learning with Maximum Entropy Mellowmax Episodic Control

    Authors: Marta Sarrico, Kai Arulkumaran, Andrea Agostinelli, Pierre Richemond, Anil Anthony Bharath

    Abstract: Deep networks have enabled reinforcement learning to scale to more complex and challenging domains, but these methods typically require large quantities of training data. An alternative is to use sample-efficient episodic control methods: neuro-inspired algorithms which use non-/semi-parametric models that predict values based on storing and retrieving previously experienced transitions. One way t… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: Workshop on Biological and Artificial Reinforcement Learning, NeurIPS 2019

  11. arXiv:1911.09560  [pdf, other

    cs.LG cs.NE stat.ML

    Memory-Efficient Episodic Control Reinforcement Learning with Dynamic Online k-means

    Authors: Andrea Agostinelli, Kai Arulkumaran, Marta Sarrico, Pierre Richemond, Anil Anthony Bharath

    Abstract: Recently, neuro-inspired episodic control (EC) methods have been developed to overcome the data-inefficiency of standard deep reinforcement learning approaches. Using non-/semi-parametric models to estimate the value function, they learn rapidly, retrieving cached values from similar past states. In realistic scenarios, with limited resources and noisy data, maintaining meaningful representations… ▽ More

    Submitted 21 November, 2019; originally announced November 2019.

    Comments: Workshop on Biological and Artificial Reinforcement Learning, NeurIPS 2019

  12. arXiv:1902.01724  [pdf, ps, other

    cs.NE cs.AI cs.LG

    AlphaStar: An Evolutionary Computation Perspective

    Authors: Kai Arulkumaran, Antoine Cully, Julian Togelius

    Abstract: In January 2019, DeepMind revealed AlphaStar to the world-the first artificial intelligence (AI) system to beat a professional player at the game of StarCraft II-representing a milestone in the progress of AI. AlphaStar draws on many areas of AI research, including deep learning, reinforcement learning, game theory, and evolutionary computation (EC). In this paper we analyze AlphaStar primarily th… ▽ More

    Submitted 14 July, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

    Comments: Genetic and EvolutionaryComputation Conference Companion 2019

  13. arXiv:1807.06699  [pdf, other

    cs.NE cs.CV cs.LG stat.ML

    Adaptive Neural Trees

    Authors: Ryutaro Tanno, Kai Arulkumaran, Daniel C. Alexander, Antonio Criminisi, Aditya Nori

    Abstract: Deep neural networks and decision trees operate on largely separate paradigms; typically, the former performs representation learning with pre-specified architectures, while the latter is characterised by learning hierarchies over pre-specified features with data-driven architectures. We unite the two via adaptive neural trees (ANTs) that incorporates representation learning into edges, routing fu… ▽ More

    Submitted 9 June, 2019; v1 submitted 17 July, 2018; originally announced July 2018.

    Comments: International Conference on Machine Learning 2019

  14. arXiv:1805.09281  [pdf, other

    stat.ML cs.LG

    Variational Inference for Data-Efficient Model Learning in POMDPs

    Authors: Sebastian Tschiatschek, Kai Arulkumaran, Jan Stühmer, Katja Hofmann

    Abstract: Partially observable Markov decision processes (POMDPs) are a powerful abstraction for tasks that require decision making under uncertainty, and capture a wide range of real world tasks. Today, effective planning approaches exist that generate effective strategies given black-box models of a POMDP task. Yet, an open question is how to acquire accurate models for complex domains. In this paper we p… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

  15. Generative Adversarial Networks: An Overview

    Authors: Antonia Creswell, Tom White, Vincent Dumoulin, Kai Arulkumaran, Biswa Sengupta, Anil A Bharath

    Abstract: Generative adversarial networks (GANs) provide a way to learn deep representations without extensively annotated training data. They achieve this through deriving backpropagation signals through a competitive process involving a pair of networks. The representations that can be learned by GANs may be used in a variety of applications, including image synthesis, semantic image editing, style transf… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

    Comments: Accepted in the IEEE Signal Processing Magazine Special Issue on Deep Learning for Visual Understanding

  16. arXiv:1708.08487  [pdf, other

    cs.CV cs.LG stat.ML

    On denoising autoencoders trained to minimise binary cross-entropy

    Authors: Antonia Creswell, Kai Arulkumaran, Anil A. Bharath

    Abstract: Denoising autoencoders (DAEs) are powerful deep learning models used for feature extraction, data generation and network pre-training. DAEs consist of an encoder and decoder which may be trained simultaneously to minimise a loss (function) between an input and the reconstruction of a corrupted version of the input. There are two common loss functions used for training autoencoders, these include t… ▽ More

    Submitted 9 October, 2017; v1 submitted 28 August, 2017; originally announced August 2017.

    Comments: Submitted to Pattern Recognition Letters

  17. arXiv:1708.05866  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Brief Survey of Deep Reinforcement Learning

    Authors: Kai Arulkumaran, Marc Peter Deisenroth, Miles Brundage, Anil Anthony Bharath

    Abstract: Deep reinforcement learning is poised to revolutionise the field of AI and represents a step towards building autonomous systems with a higher level understanding of the visual world. Currently, deep learning is enabling reinforcement learning to scale to problems that were previously intractable, such as learning to play video games directly from pixels. Deep reinforcement learning algorithms are… ▽ More

    Submitted 28 September, 2017; v1 submitted 19 August, 2017; originally announced August 2017.

    Comments: IEEE Signal Processing Magazine, Special Issue on Deep Learning for Image Understanding (arXiv extended version)

  18. arXiv:1611.02648  [pdf, other

    cs.LG cs.NE stat.ML

    Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders

    Authors: Nat Dilokthanakul, Pedro A. M. Mediano, Marta Garnelo, Matthew C. H. Lee, Hugh Salimbeni, Kai Arulkumaran, Murray Shanahan

    Abstract: We study a variant of the variational autoencoder model (VAE) with a Gaussian mixture as a prior distribution, with the goal of performing unsupervised clustering through deep generative models. We observe that the known problem of over-regularisation that has been shown to arise in regular VAEs also manifests itself in our model and leads to cluster degeneracy. We show that a heuristic called min… ▽ More

    Submitted 13 January, 2017; v1 submitted 8 November, 2016; originally announced November 2016.

    Comments: 12 pages, 6 figures, Under review as a conference paper at ICLR 2017

  19. arXiv:1610.09296  [pdf, other

    cs.LG cs.AI stat.ML

    Improving Sampling from Generative Autoencoders with Markov Chains

    Authors: Antonia Creswell, Kai Arulkumaran, Anil Anthony Bharath

    Abstract: We focus on generative autoencoders, such as variational or adversarial autoencoders, which jointly learn a generative model alongside an inference model. Generative autoencoders are those which are trained to softly enforce a prior on the latent distribution learned by the inference model. We call the distribution to which the inference model maps observed samples, the learned latent distribution… ▽ More

    Submitted 12 January, 2017; v1 submitted 28 October, 2016; originally announced October 2016.

  20. arXiv:1609.05518  [pdf, other

    cs.AI cs.LG

    Towards Deep Symbolic Reinforcement Learning

    Authors: Marta Garnelo, Kai Arulkumaran, Murray Shanahan

    Abstract: Deep reinforcement learning (DRL) brings the power of deep neural networks to bear on the generic task of trial-and-error learning, and its effectiveness has been convincingly demonstrated on tasks such as Atari video games and the game of Go. However, contemporary DRL systems inherit a number of shortcomings from the current generation of deep learning techniques. For example, they require very l… ▽ More

    Submitted 1 October, 2016; v1 submitted 18 September, 2016; originally announced September 2016.

  21. arXiv:1604.08153  [pdf, other

    cs.LG cs.AI stat.ML

    Classifying Options for Deep Reinforcement Learning

    Authors: Kai Arulkumaran, Nat Dilokthanakul, Murray Shanahan, Anil Anthony Bharath

    Abstract: In this paper we combine one method for hierarchical reinforcement learning - the options framework - with deep Q-networks (DQNs) through the use of different "option heads" on the policy network, and a supervisory network for choosing between the different options. We utilise our setup to investigate the effects of architectural constraints in subtasks with positive and negative transfer, across… ▽ More

    Submitted 19 June, 2017; v1 submitted 27 April, 2016; originally announced April 2016.

    Comments: IJCAI 2016 Workshop on Deep Reinforcement Learning: Frontiers and Challenges