Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–7 of 7 results for author: Yim, J

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2402.04997  [pdf, other

    stat.ML cs.LG q-bio.QM

    Generative Flows on Discrete State-Spaces: Enabling Multimodal Flows with Applications to Protein Co-Design

    Authors: Andrew Campbell, Jason Yim, Regina Barzilay, Tom Rainforth, Tommi Jaakkola

    Abstract: Combining discrete and continuous data is an important capability for generative models. We present Discrete Flow Models (DFMs), a new flow-based model of discrete data that provides the missing link in enabling flow-based generative models to be applied to multimodal continuous and discrete data problems. Our key insight is that the discrete equivalent of continuous space flow matching can be rea… ▽ More

    Submitted 5 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: 60 pages, 11 figures, 6 tables; ICML 2024

  2. arXiv:2401.04082  [pdf, other

    q-bio.QM cs.LG stat.ML

    Improved motif-scaffolding with SE(3) flow matching

    Authors: Jason Yim, Andrew Campbell, Emile Mathieu, Andrew Y. K. Foong, Michael Gastegger, José Jiménez-Luna, Sarah Lewis, Victor Garcia Satorras, Bastiaan S. Veeling, Frank Noé, Regina Barzilay, Tommi S. Jaakkola

    Abstract: Protein design often begins with the knowledge of a desired function from a motif which motif-scaffolding aims to construct a functional protein around. Recently, generative models have achieved breakthrough success in designing scaffolds for a range of motifs. However, generated scaffolds tend to lack structural diversity, which can hinder success in wet-lab validation. In this work, we extend Fr… ▽ More

    Submitted 18 July, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: Preprint. Code: https://github.com/ microsoft/frame-flow

    Journal ref: Transactions on Machine Learning Research 2024

  3. arXiv:2312.02447  [pdf, other

    q-bio.BM stat.ML

    Fast non-autoregressive inverse folding with discrete diffusion

    Authors: John J. Yang, Jason Yim, Regina Barzilay, Tommi Jaakkola

    Abstract: Generating protein sequences that fold into a intended 3D structure is a fundamental step in de novo protein design. De facto methods utilize autoregressive generation, but this eschews higher order interactions that could be exploited to improve inference speed. We describe a non-autoregressive alternative that performs inference using a constant number of calls resulting in a 23 times speed up w… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: NeurIPS Machine learning for Stuctural Biology workshop

  4. arXiv:2310.05297  [pdf, other

    q-bio.QM

    Fast protein backbone generation with SE(3) flow matching

    Authors: Jason Yim, Andrew Campbell, Andrew Y. K. Foong, Michael Gastegger, José Jiménez-Luna, Sarah Lewis, Victor Garcia Satorras, Bastiaan S. Veeling, Regina Barzilay, Tommi Jaakkola, Frank Noé

    Abstract: We present FrameFlow, a method for fast protein backbone generation using SE(3) flow matching. Specifically, we adapt FrameDiff, a state-of-the-art diffusion model, to the flow-matching generative modeling paradigm. We show how flow matching can be applied on SE(3) and propose modifications during training to effectively learn the vector field. Compared to FrameDiff, FrameFlow requires five times… ▽ More

    Submitted 10 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: Preprint

  5. arXiv:2307.00494  [pdf, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    Improving Protein Optimization with Smoothed Fitness Landscapes

    Authors: Andrew Kirjner, Jason Yim, Raman Samusevich, Shahar Bracha, Tommi Jaakkola, Regina Barzilay, Ila Fiete

    Abstract: The ability to engineer novel proteins with higher fitness for a desired property would be revolutionary for biotechnology and medicine. Modeling the combinatorially large space of sequences is infeasible; prior methods often constrain optimization to a small mutational radius, but this drastically limits the design space. Instead of heuristics, we propose smoothing the fitness landscape to facili… ▽ More

    Submitted 2 March, 2024; v1 submitted 2 July, 2023; originally announced July 2023.

    Comments: ICLR 2024. Code: https://github.com/kirjner/GGS

  6. arXiv:2302.02277  [pdf, other

    cs.LG q-bio.QM stat.ML

    SE(3) diffusion model with application to protein backbone generation

    Authors: Jason Yim, Brian L. Trippe, Valentin De Bortoli, Emile Mathieu, Arnaud Doucet, Regina Barzilay, Tommi Jaakkola

    Abstract: The design of novel protein structures remains a challenge in protein engineering for applications across biomedicine and chemistry. In this line of work, a diffusion model over rigid bodies in 3D (referred to as frames) has shown success in generating novel, functional protein backbones that have not been observed in nature. However, there exists no principled methodological framework for diffusi… ▽ More

    Submitted 22 May, 2023; v1 submitted 4 February, 2023; originally announced February 2023.

    Journal ref: International Conference of Machine Learning (ICML) 2023

  7. arXiv:2206.04119  [pdf, other

    q-bio.BM cs.LG stat.ML

    Diffusion probabilistic modeling of protein backbones in 3D for the motif-scaffolding problem

    Authors: Brian L. Trippe, Jason Yim, Doug Tischer, David Baker, Tamara Broderick, Regina Barzilay, Tommi Jaakkola

    Abstract: Construction of a scaffold structure that supports a desired motif, conferring protein function, shows promise for the design of vaccines and enzymes. But a general solution to this motif-scaffolding problem remains open. Current machine-learning techniques for scaffold design are either limited to unrealistically small scaffolds (up to length 20) or struggle to produce multiple diverse scaffolds.… ▽ More

    Submitted 19 March, 2023; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Appearing in ICLR 2023. Code available: github.com/blt2114/ProtDiff_SMCDiff