-
MoFormer: Multi-objective Antimicrobial Peptide Generation Based on Conditional Transformer Joint Multi-modal Fusion Descriptor
Authors:
Li Wang,
Xiangzheng Fu,
Jiahao Yang,
Xinyi Zhang,
Xiucai Ye,
Yiping Liu,
Tetsuya Sakurai,
Xiangxiang Zeng
Abstract:
Deep learning holds a big promise for optimizing existing peptides with more desirable properties, a critical step towards accelerating new drug discovery. Despite the recent emergence of several optimized Antimicrobial peptides(AMP) generation methods, multi-objective optimizations remain still quite challenging for the idealism-realism tradeoff. Here, we establish a multi-objective AMP synthesis…
▽ More
Deep learning holds a big promise for optimizing existing peptides with more desirable properties, a critical step towards accelerating new drug discovery. Despite the recent emergence of several optimized Antimicrobial peptides(AMP) generation methods, multi-objective optimizations remain still quite challenging for the idealism-realism tradeoff. Here, we establish a multi-objective AMP synthesis pipeline (MoFormer) for the simultaneous optimization of multi-attributes of AMPs. MoFormer improves the desired attributes of AMP sequences in a highly structured latent space, guided by conditional constraints and fine-grained multi-descriptor.We show that MoFormer outperforms existing methods in the generation task of enhanced antimicrobial activity and minimal hemolysis. We also utilize a Pareto-based non-dominated sorting algorithm and proxies based on large model fine-tuning to hierarchically rank the candidates. We demonstrate substantial property improvement using MoFormer from two perspectives: (1) employing molecular simulations and scoring interactions among amino acids to decipher the structure and functionality of AMPs; (2) visualizing latent space to examine the qualities and distribution features, verifying an effective means to facilitate multi-objective optimization AMPs with design constraints
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Unveiling interpretable development-specific gene signatures in the developing human prefrontal cortex with ICGS
Authors:
Meng Huang,
Xiucai Ye,
Tetsuya Sakurai
Abstract:
In this paper, to unveil interpretable development-specific gene signatures in human PFC, we propose a novel gene selection method, named Interpretable Causality Gene Selection (ICGS), which adopts a Bayesian Network (BN) to represent causality between multiple gene variables and a development variable. The proposed ICGS method combines the positive instances-based contrastive learning with a Vari…
▽ More
In this paper, to unveil interpretable development-specific gene signatures in human PFC, we propose a novel gene selection method, named Interpretable Causality Gene Selection (ICGS), which adopts a Bayesian Network (BN) to represent causality between multiple gene variables and a development variable. The proposed ICGS method combines the positive instances-based contrastive learning with a Variational AutoEncoder (VAE) to obtain this optimal BN structure and use a Markov Blanket (MB) to identify gene signatures causally related to the development variable. Moreover, the differential expression genes (DEGs) are used to filter redundant genes before gene selection. In order to identify gene signatures, we apply the proposed ICGS to the human PFC single-cell transcriptomics data. The experimental results demonstrate that the proposed method can effectively identify interpretable development-specific gene signatures in human PFC. Gene ontology enrichment analysis and ASD-related gene analysis show that these identified gene signatures reveal the key biological processes and pathways in human PFC and have more potential for neurodevelopment disorder cure. These gene signatures are expected to bring important implications for understanding PFC development heterogeneity and function in humans.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
Inferring cell-specific lncRNA regulation with single-cell RNA-sequencing data in the developing human neocortex
Authors:
Meng Huang,
Jiangtao Ma,
Changzhou Long,
Junpeng Zhang,
Xiucai Ye,
Tetsuya Sakurai
Abstract:
Long non-coding RNAs (lncRNAs) are important regulators to modulate gene expression and cell proliferation in the developing human brain. Previous methods mainly use bulk lncRNA and mRNA expression data to study lncRNA regulation. However, to analyze lncRNA regulation regarding individual cells, we focus on single-cell RNA-sequencing (scRNA-seq) data instead of bulk data. Recent advance in scRNA-s…
▽ More
Long non-coding RNAs (lncRNAs) are important regulators to modulate gene expression and cell proliferation in the developing human brain. Previous methods mainly use bulk lncRNA and mRNA expression data to study lncRNA regulation. However, to analyze lncRNA regulation regarding individual cells, we focus on single-cell RNA-sequencing (scRNA-seq) data instead of bulk data. Recent advance in scRNA-seq has provided a way to investigate lncRNA regulation at single-cell level. We will propose a novel computational method, CSlncR (cell-specific lncRNA regulation), which combines putative lncRNA-mRNA binding information with scRNA-seq data including lncRNAs and mRNAs to identify cell-specific lncRNA-mRNA regulation networks at individual cells. To understand lncRNA regulation at different development stages, we apply CSlncR to the scRNA-seq data of human neocortex. Network analysis shows that the lncRNA regulation is unique in each cell from the different human neocortex development stages. The comparison results indicate that CSlncR is also an effective tool for predicting cell-specific lncRNA targets and clustering single cells, which helps understand cell-cell communication.
△ Less
Submitted 29 November, 2022; v1 submitted 15 November, 2022;
originally announced November 2022.