Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–10 of 10 results for author: Anandkumar, A

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2406.15669  [pdf, other

    q-bio.BM

    CARE: a Benchmark Suite for the Classification and Retrieval of Enzymes

    Authors: Jason Yang, Ariane Mora, Shengchao Liu, Bruce J. Wittmann, Anima Anandkumar, Frances H. Arnold, Yisong Yue

    Abstract: Enzymes are important proteins that catalyze chemical reactions. In recent years, machine learning methods have emerged to predict enzyme function from sequence; however, there are no standardized benchmarks to evaluate these methods. We introduce CARE, a benchmark and dataset suite for the Classification And Retrieval of Enzymes (CARE). CARE centers on two tasks: (1) classification of a protein s… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  2. arXiv:2401.15122  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM stat.ML

    A Multi-Grained Symmetric Differential Equation Model for Learning Protein-Ligand Binding Dynamics

    Authors: Shengchao Liu, Weitao Du, Yanjing Li, Zhuoxinran Li, Vignesh Bhethanabotla, Nakul Rampal, Omar Yaghi, Christian Borgs, Anima Anandkumar, Hongyu Guo, Jennifer Chayes

    Abstract: In drug discovery, molecular dynamics (MD) simulation for protein-ligand binding provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites. There has been a long history of improving the efficiency of MD simulations through better numerical methods and, more recently, by utilizing machine learning (ML) methods. Yet, challenges remain, s… ▽ More

    Submitted 1 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  3. arXiv:2401.11037  [pdf, other

    cs.LG math.NA q-bio.QM

    Equivariant Graph Neural Operator for Modeling 3D Dynamics

    Authors: Minkai Xu, Jiaqi Han, Aaron Lou, Jean Kossaifi, Arvind Ramanathan, Kamyar Azizzadenesheli, Jure Leskovec, Stefano Ermon, Anima Anandkumar

    Abstract: Modeling the complex three-dimensional (3D) dynamics of relational systems is an important problem in the natural sciences, with applications ranging from molecular simulations to particle mechanics. Machine learning methods have achieved good success by learning graph neural networks to model spatial interactions. However, these approaches do not faithfully capture temporal correlations since the… ▽ More

    Submitted 2 June, 2024; v1 submitted 19 January, 2024; originally announced January 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024. Copyright 2024 by the author(s)

  4. arXiv:2306.09375  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    Symmetry-Informed Geometric Representation for Molecules, Proteins, and Crystalline Materials

    Authors: Shengchao Liu, Weitao Du, Yanjing Li, Zhuoxinran Li, Zhiling Zheng, Chenru Duan, Zhiming Ma, Omar Yaghi, Anima Anandkumar, Christian Borgs, Jennifer Chayes, Hongyu Guo, Jian Tang

    Abstract: Artificial intelligence for scientific discovery has recently generated significant interest within the machine learning and scientific communities, particularly in the domains of chemistry, biology, and material discovery. For these scientific problems, molecules serve as the fundamental building blocks, and machine learning has emerged as a highly effective and powerful tool for modeling their g… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  5. arXiv:2302.04611  [pdf, other

    cs.LG cs.AI q-bio.QM stat.ML

    A Text-guided Protein Design Framework

    Authors: Shengchao Liu, Yanjing Li, Zhuoxinran Li, Anthony Gitter, Yutao Zhu, Jiarui Lu, Zhao Xu, Weili Nie, Arvind Ramanathan, Chaowei Xiao, Jian Tang, Hongyu Guo, Anima Anandkumar

    Abstract: Current AI-assisted protein design mainly utilizes protein sequential and structural information. Meanwhile, there exists tremendous knowledge curated by humans in the text format describing proteins' high-level functionalities. Yet, whether the incorporation of such text data can help protein design tasks has not been explored. To bridge this gap, we propose ProteinDT, a multi-modal framework tha… ▽ More

    Submitted 3 December, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

  6. arXiv:2212.10789  [pdf, other

    cs.LG cs.CL q-bio.QM stat.ML

    Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing

    Authors: Shengchao Liu, Weili Nie, Chengpeng Wang, Jiarui Lu, Zhuoran Qiao, Ling Liu, Jian Tang, Chaowei Xiao, Anima Anandkumar

    Abstract: There is increasing adoption of artificial intelligence in drug discovery. However, existing studies use machine learning to mainly utilize the chemical structures of molecules but ignore the vast textual knowledge available in chemistry. Incorporating textual knowledge enables us to realize new drug design objectives, adapt to text-based instructions and predict complex biological activities. Her… ▽ More

    Submitted 29 January, 2024; v1 submitted 21 December, 2022; originally announced December 2022.

  7. arXiv:2209.15171  [pdf, other

    q-bio.QM cs.LG q-bio.BM

    State-specific protein-ligand complex structure prediction with a multi-scale deep generative model

    Authors: Zhuoran Qiao, Weili Nie, Arash Vahdat, Thomas F. Miller III, Anima Anandkumar

    Abstract: The binding complexes formed by proteins and small molecule ligands are ubiquitous and critical to life. Despite recent advancements in protein structure prediction, existing algorithms are so far unable to systematically predict the binding ligand structures along with their regulatory effects on protein folding. To address this discrepancy, we present NeuralPLexer, a computational approach that… ▽ More

    Submitted 19 April, 2023; v1 submitted 29 September, 2022; originally announced September 2022.

    Comments: 19 pages, 5 figures, 1 table & Supplementary Information (18 pages, 2 figures, 7 tables, 12 algorithms); supersedes an earlier version arXiv:2209.15171v1 presented at the NeurIPS 2022 MLSB workshop as a contributed talk

  8. arXiv:2208.11126  [pdf, other

    q-bio.QM cs.LG

    Retrieval-based Controllable Molecule Generation

    Authors: Zichao Wang, Weili Nie, Zhuoran Qiao, Chaowei Xiao, Richard Baraniuk, Anima Anandkumar

    Abstract: Generating new molecules with specified chemical and biological properties via generative models has emerged as a promising direction for drug discovery. However, existing methods require extensive training/fine-tuning with a large dataset, often unavailable in real-world generation tasks. In this work, we propose a new retrieval-based framework for controllable molecule generation. We use a small… ▽ More

    Submitted 24 April, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: ICLR 2023

  9. arXiv:1609.06335  [pdf, other

    q-bio.MN cs.LG

    Unsupervised learning of transcriptional regulatory networks via latent tree graphical models

    Authors: Anthony Gitter, Furong Huang, Ragupathyraj Valluvan, Ernest Fraenkel, Animashree Anandkumar

    Abstract: Gene expression is a readily-observed quantification of transcriptional activity and cellular state that enables the recovery of the relationships between regulators and their target genes. Reconstructing transcriptional regulatory networks from gene expression data is a problem that has attracted much attention, but previous work often makes the simplifying (but unrealistic) assumption that regul… ▽ More

    Submitted 20 September, 2016; originally announced September 2016.

    Comments: 37 pages, 9 figures

  10. arXiv:1602.01889  [pdf, ps, other

    q-bio.NC stat.ML

    Discovering Neuronal Cell Types and Their Gene Expression Profiles Using a Spatial Point Process Mixture Model

    Authors: Furong Huang, Animashree Anandkumar, Christian Borgs, Jennifer Chayes, Ernest Fraenkel, Michael Hawrylycz, Ed Lein, Alessandro Ingrosso, Srinivas Turaga

    Abstract: Cataloging the neuronal cell types that comprise circuitry of individual brain regions is a major goal of modern neuroscience and the BRAIN initiative. Single-cell RNA sequencing can now be used to measure the gene expression profiles of individual neurons and to categorize neurons based on their gene expression profiles. While the single-cell techniques are extremely powerful and hold great promi… ▽ More

    Submitted 10 June, 2016; v1 submitted 4 February, 2016; originally announced February 2016.