Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–5 of 5 results for author: Sujit, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03154  [pdf, other

    cs.LG cs.AI q-bio.BM

    Reinforcement Learning for Sequence Design Leveraging Protein Language Models

    Authors: Jithendaraa Subramanian, Shivakanth Sujit, Niloy Irtisam, Umong Sain, Derek Nowrouzezahrai, Samira Ebrahimi Kahou, Riashat Islam

    Abstract: Protein sequence design, determined by amino acid sequences, are essential to protein engineering problems in drug discovery. Prior approaches have resorted to evolutionary strategies or Monte-Carlo methods for protein design, but often fail to exploit the structure of the combinatorial search space, to generalize to unseen sequences. In the context of discrete black box optimization over large se… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 22 pages, 7 figures, 4 tables

  2. arXiv:2311.03534  [pdf, other

    cs.LG cs.AI cs.RO

    PcLast: Discovering Plannable Continuous Latent States

    Authors: Anurag Koul, Shivakanth Sujit, Shaoru Chen, Ben Evans, Lili Wu, Byron Xu, Rajan Chari, Riashat Islam, Raihan Seraj, Yonathan Efroni, Lekan Molu, Miro Dudik, John Langford, Alex Lamb

    Abstract: Goal-conditioned planning benefits from learned low-dimensional representations of rich observations. While compact latent representations typically learned from variational autoencoders or inverse dynamics enable goal-conditioned decision making, they ignore state reachability, hampering their performance. In this paper, we learn a representation that associates reachable states together for effe… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted at ICML 2024

  3. arXiv:2212.08131  [pdf, other

    cs.LG

    Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies

    Authors: Shivakanth Sujit, Pedro H. M. Braga, Jorg Bornschein, Samira Ebrahimi Kahou

    Abstract: Reinforcement learning (RL) has shown great promise with algorithms learning in environments with large state and action spaces purely from scalar reward signals. A crucial challenge for current deep RL algorithms is that they require a tremendous amount of environment interactions for learning. This can be infeasible in situations where such interactions are expensive; such as in robotics. Offlin… ▽ More

    Submitted 21 November, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: TMLR 2023

  4. arXiv:2210.11698  [pdf, other

    cs.LG cs.AI

    Learning Robust Dynamics through Variational Sparse Gating

    Authors: Arnav Kumar Jain, Shivakanth Sujit, Shruti Joshi, Vincent Michalski, Danijar Hafner, Samira Ebrahimi-Kahou

    Abstract: Learning world models from their sensory inputs enables agents to plan for actions by imagining their future outcomes. World models have previously been shown to improve sample-efficiency in simulated environments with few objects, but have not yet been applied successfully to environments with many objects. In environments with many objects, often only a small number of them are moving or interac… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  5. arXiv:2208.10483  [pdf, other

    cs.LG cs.AI

    Prioritizing Samples in Reinforcement Learning with Reducible Loss

    Authors: Shivakanth Sujit, Somjit Nath, Pedro H. M. Braga, Samira Ebrahimi Kahou

    Abstract: Most reinforcement learning algorithms take advantage of an experience replay buffer to repeatedly train on samples the agent has observed in the past. Not all samples carry the same amount of significance and simply assigning equal importance to each of the samples is a naïve strategy. In this paper, we propose a method to prioritize samples based on how much we can learn from a sample. We define… ▽ More

    Submitted 1 November, 2023; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: NeurIPS 2023