Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–9 of 9 results for author: Zhong, B

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2406.19755  [pdf, other

    q-bio.QM cs.AI

    Protein Representation Learning with Sequence Information Embedding: Does it Always Lead to a Better Performance?

    Authors: Yang Tan, Lirong Zheng, Bozitao Zhong, Liang Hong, Bingxin Zhou

    Abstract: Deep learning has become a crucial tool in studying proteins. While the significance of modeling protein structure has been discussed extensively in the literature, amino acid types are typically included in the input as a default operation for many inference tasks. This study demonstrates with structure alignment task that embedding amino acid types in some cases may not help a deep learning mode… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures

  2. arXiv:2404.14850  [pdf, other

    cs.CL cs.LG q-bio.BM

    Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models

    Authors: Yang Tan, Mingchen Li, Bingxin Zhou, Bozitao Zhong, Lirong Zheng, Pan Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang Hong

    Abstract: Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches. As a widely applied powerful technique in natural language processing, employing Parameter-Efficient Fine-Tuning techniques could potentially enhance the performance of PLMs. However, the direct transfe… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 30 pages, 4 figures, 8 tables

  3. arXiv:2402.10433  [pdf, other

    q-bio.BM cs.LG q-bio.QM

    Fusing Neural and Physical: Augment Protein Conformation Sampling with Tractable Simulations

    Authors: Jiarui Lu, Zuobai Zhang, Bozitao Zhong, Chence Shi, Jian Tang

    Abstract: The protein dynamics are common and important for their biological functions and properties, the study of which usually involves time-consuming molecular dynamics (MD) simulations in silico. Recently, generative models has been leveraged as a surrogate sampler to obtain conformation ensembles with orders of magnitude faster and without requiring any simulation data (a "zero-shot" inference). Howev… ▽ More

    Submitted 11 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Published at the GEM workshop, ICLR 2024

  4. arXiv:2312.00080  [pdf, other

    q-bio.QM cs.LG

    PDB-Struct: A Comprehensive Benchmark for Structure-based Protein Design

    Authors: Chuanrui Wang, Bozitao Zhong, Zuobai Zhang, Narendra Chaudhary, Sanchit Misra, Jian Tang

    Abstract: Structure-based protein design has attracted increasing interest, with numerous methods being introduced in recent years. However, a universally accepted method for evaluation has not been established, since the wet-lab validation can be overly time-consuming for the development of new algorithms, and the $\textit{in silico}$ validation with recovery and perplexity metrics is efficient but may not… ▽ More

    Submitted 29 November, 2023; originally announced December 2023.

    Comments: 13 pages

  5. arXiv:2306.03117  [pdf, other

    q-bio.QM cs.LG q-bio.BM

    Str2Str: A Score-based Framework for Zero-shot Protein Conformation Sampling

    Authors: Jiarui Lu, Bozitao Zhong, Zuobai Zhang, Jian Tang

    Abstract: The dynamic nature of proteins is crucial for determining their biological functions and properties, for which Monte Carlo (MC) and molecular dynamics (MD) simulations stand as predominant tools to study such phenomena. By utilizing empirically derived force fields, MC or MD simulations explore the conformational space through numerically evolving the system via Markov chain or Newtonian mechanics… ▽ More

    Submitted 11 March, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: Published as a conference paper at ICLR 2024, see https://openreview.net/forum?id=C4BikKsgmK

  6. arXiv:2306.01794  [pdf, other

    q-bio.QM cs.LG

    DiffPack: A Torsional Diffusion Model for Autoregressive Protein Side-Chain Packing

    Authors: Yangtian Zhang, Zuobai Zhang, Bozitao Zhong, Sanchit Misra, Jian Tang

    Abstract: Proteins play a critical role in carrying out biological functions, and their 3D structures are essential in determining their functions. Accurately predicting the conformation of protein side-chains given their backbones is important for applications in protein structure prediction, design and protein-protein interactions. Traditional methods are computationally intensive and have limited accurac… ▽ More

    Submitted 15 February, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  7. arXiv:2210.08761  [pdf, other

    q-bio.BM cs.LG

    Protein Sequence and Structure Co-Design with Equivariant Translation

    Authors: Chence Shi, Chuanrui Wang, Jiarui Lu, Bozitao Zhong, Jian Tang

    Abstract: Proteins are macromolecules that perform essential functions in all living organisms. Designing novel proteins with specific structures and desired functions has been a long-standing challenge in the field of bioengineering. Existing approaches generate both protein sequence and structure using either autoregressive models or diffusion models, both of which suffer from high inference costs. In thi… ▽ More

    Submitted 2 March, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

    Comments: Published as a conference paper at ICLR 2023, see https://openreview.net/forum?id=pRCMXcfdihq

  8. arXiv:2210.06069  [pdf, other

    q-bio.BM cs.LG

    E3Bind: An End-to-End Equivariant Network for Protein-Ligand Docking

    Authors: Yangtian Zhang, Huiyu Cai, Chence Shi, Bozitao Zhong, Jian Tang

    Abstract: In silico prediction of the ligand binding pose to a given protein target is a crucial but challenging task in drug discovery. This work focuses on blind flexible selfdocking, where we aim to predict the positions, orientations and conformations of docked molecules. Traditional physics-based methods usually suffer from inaccurate scoring functions and high inference cost. Recently, data-driven met… ▽ More

    Submitted 1 June, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: International Conference on Learning Representations (ICLR 2023)

  9. arXiv:2111.06340  [pdf, other

    q-bio.BM

    ParaFold: Paralleling AlphaFold for Large-Scale Predictions

    Authors: Bozitao Zhong, Xiaoming Su, Minhua Wen, Sichen Zuo, Liang Hong, James Lin

    Abstract: AlphaFold predicts protein structures from the amino acid sequence at or near experimental resolution, solving the 50-year-old protein folding challenge, leading to progress by transforming large-scale genomics data into protein structures. AlphaFold will also greatly change the scientific research model from low-throughput to high-throughput manner. The AlphaFold framework is a mixture of two typ… ▽ More

    Submitted 13 November, 2021; v1 submitted 11 November, 2021; originally announced November 2021.