Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–6 of 6 results for author: AlQuraishi, M

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2405.15489  [pdf, other

    q-bio.BM cs.LG

    Out of Many, One: Designing and Scaffolding Proteins at the Scale of the Structural Universe with Genie 2

    Authors: Yeqing Lin, Minji Lee, Zhao Zhang, Mohammed AlQuraishi

    Abstract: Protein diffusion models have emerged as a promising approach for protein design. One such pioneering model is Genie, a method that asymmetrically represents protein structures during the forward and backward processes, using simple Gaussian noising for the former and expressive SE(3)-equivariant attention for the latter. In this work we introduce Genie 2, extending Genie to capture a larger and m… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  2. arXiv:2310.06725  [pdf, other

    q-bio.BM cs.LG

    Growing ecosystem of deep learning methods for modeling protein$\unicode{x2013}$protein interactions

    Authors: Julia R. Rogers, Gergő Nikolényi, Mohammed AlQuraishi

    Abstract: Numerous cellular functions rely on protein$\unicode{x2013}$protein interactions. Efforts to comprehensively characterize them remain challenged however by the diversity of molecular recognition mechanisms employed within the proteome. Deep learning has emerged as a promising approach for tackling this problem by exploiting both experimental data and basic biophysical knowledge about protein inter… ▽ More

    Submitted 6 December, 2023; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 19 pages, added model names to discussion

  3. arXiv:2308.05326  [pdf, other

    q-bio.BM cs.LG

    OpenProteinSet: Training data for structural biology at scale

    Authors: Gustaf Ahdritz, Nazim Bouatta, Sachin Kadyan, Lukas Jarosch, Daniel Berenberg, Ian Fisk, Andrew M. Watkins, Stephen Ra, Richard Bonneau, Mohammed AlQuraishi

    Abstract: Multiple sequence alignments (MSAs) of proteins encode rich biological information and have been workhorses in bioinformatic methods for tasks like protein design and protein structure prediction for decades. Recent breakthroughs like AlphaFold2 that use transformers to attend directly over large quantities of raw MSAs have reaffirmed their importance. Generation of MSAs is highly computationally… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  4. arXiv:2301.12485  [pdf, other

    q-bio.BM cs.LG

    Generating Novel, Designable, and Diverse Protein Structures by Equivariantly Diffusing Oriented Residue Clouds

    Authors: Yeqing Lin, Mohammed AlQuraishi

    Abstract: Proteins power a vast array of functional processes in living cells. The capability to create new proteins with designed structures and functions would thus enable the engineering of cellular behavior and development of protein-based therapeutics and materials. Structure-based protein design aims to find structures that are designable (can be realized by a protein sequence), novel (have dissimilar… ▽ More

    Submitted 6 June, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

  5. arXiv:1911.05531  [pdf, other

    q-bio.BM cs.LG stat.ML

    Accurate Protein Structure Prediction by Embeddings and Deep Learning Representations

    Authors: Iddo Drori, Darshan Thaker, Arjun Srivatsa, Daniel Jeong, Yueqi Wang, Linyong Nan, Fan Wu, Dimitri Leggas, Jinhao Lei, Weiyi Lu, Weilong Fu, Yuan Gao, Sashank Karri, Anand Kannan, Antonio Moretti, Mohammed AlQuraishi, Chen Keasar, Itsik Pe'er

    Abstract: Proteins are the major building blocks of life, and actuators of almost all chemical and biophysical events in living organisms. Their native structures in turn enable their biological functions which have a fundamental role in drug design. This motivates predicting the structure of a protein from its sequence of amino acids, a fundamental problem in computational biology. In this work, we demonst… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Journal ref: Machine Learning in Computational Biology, 2019

  6. arXiv:1902.00249  [pdf

    q-bio.BM cs.LG q-bio.QM stat.ML

    ProteinNet: a standardized data set for machine learning of protein structure

    Authors: Mohammed AlQuraishi

    Abstract: Rapid progress in deep learning has spurred its application to bioinformatics problems including protein structure prediction and design. In classic machine learning problems like computer vision, progress has been driven by standardized data sets that facilitate fair assessment of new methods and lower the barrier to entry for non-domain experts. While data sets of protein sequence and structure… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Comments: 8 pages, 6 figures, 1 table