Search | arXiv e-print repository

Revolutionizing MRI Data Processing Using FSL: Preliminary Findings with the Fugaku Supercomputer

Authors: Tianxiang Lyu, Wataru Uchida, Zhe Sun, Christina Andica, Keita Tokuda, Rui Zou, Jie Mao, Keigo Shimoji, Koji Kamagata, Mitsuhisa Sato, Ryutaro Himeno, Shigeki Aoki

Abstract: The amount of Magnetic resonance imaging data has grown tremendously recently, creating an urgent need to accelerate data processing, which requires substantial computational resources and time. In this preliminary study, we applied FMRIB Software Library commands on T1-weighted and diffusion-weighted images of a single young adult using the Fugaku supercomputer. The tensor-based measurements and… ▽ More The amount of Magnetic resonance imaging data has grown tremendously recently, creating an urgent need to accelerate data processing, which requires substantial computational resources and time. In this preliminary study, we applied FMRIB Software Library commands on T1-weighted and diffusion-weighted images of a single young adult using the Fugaku supercomputer. The tensor-based measurements and subcortical structure segmentations performed on Fugaku supercomputer were highly consistent with those from conventional systems, demonstrating its reliability and significantly reduced processing time. △ Less

Submitted 16 July, 2024; originally announced July 2024.

arXiv:2407.01649 [pdf, other]

FAFE: Immune Complex Modeling with Geodesic Distance Loss on Noisy Group Frames

Authors: Ruidong Wu, Ruihan Guo, Rui Wang, Shitong Luo, Yue Xu, Jiahan Li, Jianzhu Ma, Qiang Liu, Yunan Luo, Jian Peng

Abstract: Despite the striking success of general protein folding models such as AlphaFold2(AF2, Jumper et al. (2021)), the accurate computational modeling of antibody-antigen complexes remains a challenging task. In this paper, we first analyze AF2's primary loss function, known as the Frame Aligned Point Error (FAPE), and raise a previously overlooked issue that FAPE tends to face gradient vanishing probl… ▽ More Despite the striking success of general protein folding models such as AlphaFold2(AF2, Jumper et al. (2021)), the accurate computational modeling of antibody-antigen complexes remains a challenging task. In this paper, we first analyze AF2's primary loss function, known as the Frame Aligned Point Error (FAPE), and raise a previously overlooked issue that FAPE tends to face gradient vanishing problem on high-rotational-error targets. To address this fundamental limitation, we propose a novel geodesic loss called Frame Aligned Frame Error (FAFE, denoted as F2E to distinguish from FAPE), which enables the model to better optimize both the rotational and translational errors between two frames. We then prove that F2E can be reformulated as a group-aware geodesic loss, which translates the optimization of the residue-to-residue error to optimizing group-to-group geodesic frame distance. By fine-tuning AF2 with our proposed new loss function, we attain a correct rate of 52.3\% (DockQ $>$ 0.23) on an evaluation set and 43.8\% correct rate on a subset with low homology, with substantial improvement over AF2 by 182\% and 100\% respectively. △ Less

Submitted 1 July, 2024; originally announced July 2024.

arXiv:2406.04628 [pdf, other]

Projecting Molecules into Synthesizable Chemical Spaces

Authors: Shitong Luo, Wenhao Gao, Zuofan Wu, Jian Peng, Connor W. Coley, Jianzhu Ma

Abstract: Discovering new drug molecules is a pivotal yet challenging process due to the near-infinitely large chemical space and notorious demands on time and resources. Numerous generative models have recently been introduced to accelerate the drug discovery process, but their progression to experimental validation remains limited, largely due to a lack of consideration for synthetic accessibility in prac… ▽ More Discovering new drug molecules is a pivotal yet challenging process due to the near-infinitely large chemical space and notorious demands on time and resources. Numerous generative models have recently been introduced to accelerate the drug discovery process, but their progression to experimental validation remains limited, largely due to a lack of consideration for synthetic accessibility in practical settings. In this work, we introduce a novel framework that is capable of generating new chemical structures while ensuring synthetic accessibility. Specifically, we introduce a postfix notation of synthetic pathways to represent molecules in chemical space. Then, we design a transformer-based model to translate molecular graphs into postfix notations of synthesis. We highlight the model's ability to: (a) perform bottom-up synthesis planning more accurately, (b) generate structurally similar, synthesizable analogs for unsynthesizable molecules proposed by generative models with their properties preserved, and (c) explore the local synthesizable chemical space around hit molecules. △ Less

Submitted 7 June, 2024; originally announced June 2024.

arXiv:2406.00735 [pdf, other]

Full-Atom Peptide Design based on Multi-modal Flow Matching

Authors: Jiahan Li, Chaoran Cheng, Zuofan Wu, Ruihan Guo, Shitong Luo, Zhizhou Ren, Jian Peng, Jianzhu Ma

Abstract: Peptides, short chains of amino acid residues, play a vital role in numerous biological processes by interacting with other target molecules, offering substantial potential in drug discovery. In this work, we present PepFlow, the first multi-modal deep generative model grounded in the flow-matching framework for the design of full-atom peptides that target specific protein receptors. Drawing inspi… ▽ More Peptides, short chains of amino acid residues, play a vital role in numerous biological processes by interacting with other target molecules, offering substantial potential in drug discovery. In this work, we present PepFlow, the first multi-modal deep generative model grounded in the flow-matching framework for the design of full-atom peptides that target specific protein receptors. Drawing inspiration from the crucial roles of residue backbone orientations and side-chain dynamics in protein-peptide interactions, we characterize the peptide structure using rigid backbone frames within the $\mathrm{SE}(3)$ manifold and side-chain angles on high-dimensional tori. Furthermore, we represent discrete residue types in the peptide sequence as categorical distributions on the probability simplex. By learning the joint distributions of each modality using derived flows and vector fields on corresponding manifolds, our method excels in the fine-grained design of full-atom peptides. Harnessing the multi-modal paradigm, our approach adeptly tackles various tasks such as fix-backbone sequence design and side-chain packing through partial sampling. Through meticulously crafted experiments, we demonstrate that PepFlow exhibits superior performance in comprehensive benchmarks, highlighting its significant potential in computational peptide design and analysis. △ Less

Submitted 2 June, 2024; originally announced June 2024.

Comments: ICML 2024

arXiv:2403.17615 [pdf, other]

Grad-CAMO: Learning Interpretable Single-Cell Morphological Profiles from 3D Cell Painting Images

Authors: Vivek Gopalakrishnan, Jingzhe Ma, Zhiyong Xie

Abstract: Despite their black-box nature, deep learning models are extensively used in image-based drug discovery to extract feature vectors from single cells in microscopy images. To better understand how these networks perform representation learning, we employ visual explainability techniques (e.g., Grad-CAM). Our analyses reveal several mechanisms by which supervised models cheat, exploiting biologicall… ▽ More Despite their black-box nature, deep learning models are extensively used in image-based drug discovery to extract feature vectors from single cells in microscopy images. To better understand how these networks perform representation learning, we employ visual explainability techniques (e.g., Grad-CAM). Our analyses reveal several mechanisms by which supervised models cheat, exploiting biologically irrelevant pixels when extracting morphological features from images, such as noise in the background. This raises doubts regarding the fidelity of learned single-cell representations and their relevance when investigating downstream biological questions. To address this misalignment between researcher expectations and machine behavior, we introduce Grad-CAMO, a novel single-cell interpretability score for supervised feature extractors. Grad-CAMO measures the proportion of a model's attention that is concentrated on the cell of interest versus the background. This metric can be assessed per-cell or averaged across a validation set, offering a tool to audit individual features vectors or guide the improved design of deep learning architectures. Importantly, Grad-CAMO seamlessly integrates into existing workflows, requiring no dataset or model modifications, and is compatible with both 2D and 3D Cell Painting data. Additional results are available at https://github.com/eigenvivek/Grad-CAMO. △ Less

Submitted 26 March, 2024; originally announced March 2024.

arXiv:2403.17513 [pdf, other]

A unified framework for coarse grained molecular dynamics of proteins

Authors: Jinzhen Zhu, Jianpeng Ma

Abstract: Understanding protein dynamics is crucial for elucidating their biological functions. While all-atom molecular dynamics (MD) simulations provide detailed information, coarse-grained (CG) MD simulations capture the essential collective motions of proteins at significantly lower computational cost. In this article, we present a unified framework for coarse-grained molecular dynamics simulation of pr… ▽ More Understanding protein dynamics is crucial for elucidating their biological functions. While all-atom molecular dynamics (MD) simulations provide detailed information, coarse-grained (CG) MD simulations capture the essential collective motions of proteins at significantly lower computational cost. In this article, we present a unified framework for coarse-grained molecular dynamics simulation of proteins. Our approach utilizes a tree-structured representation of collective variables, enabling reconstruction of protein Cartesian coordinates with high fidelity. The evolution of configurations is constructed using a deep neural network trained on trajectories generated from conventional all-atom MD simulations. We demonstrate the framework's effectiveness using the 168-amino protein target T1027 from CASP14. Statistical distributions of the collective variables and time series of root mean square deviation (RMSD) obtained from our coarse-grained simulations closely resemble those from all-atom MD simulations. This method is not only useful for studying the movements of complex proteins, but also has the potential to be adapted for simulating other biomolecules like DNA, RNA, and even electrolytes in batteries. △ Less

Submitted 12 July, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

Comments: 13 pages, 9 figures

arXiv:2403.14046 [pdf]

Desiderata of evidence for representation in neuroscience

Authors: Stephan Pohl, Edgar Y. Walker, David L. Barack, Jennifer Lee, Rachel N. Denison, Ned Block, Florent Meyniel, Wei Ji Ma

Abstract: This paper develops a systematic framework for the evidence neuroscientists use to establish whether a neural response represents a feature. Researchers try to establish that the neural response is (1) sensitive and (2) specific to the feature, (3) invariant to other features, and (4) functional, which means that it is used downstream in the brain. We formalize these desiderata in information-theo… ▽ More This paper develops a systematic framework for the evidence neuroscientists use to establish whether a neural response represents a feature. Researchers try to establish that the neural response is (1) sensitive and (2) specific to the feature, (3) invariant to other features, and (4) functional, which means that it is used downstream in the brain. We formalize these desiderata in information-theoretic terms. This formalism allows us to precisely state the desiderata while unifying the different analysis methods used in neuroscience under one framework. We discuss how common methods such as correlational analyses, decoding and encoding models, representational similarity analysis, and tests of statistical dependence are used to evaluate the desiderata. In doing so, we provide a common terminology to researchers that helps to clarify disagreements, to compare and integrate results across studies and research groups, and to identify when evidence might be missing and when evidence for some representational conclusion is strong. We illustrate the framework with several canonical examples, including the representation of orientation, numerosity, faces, and spatial location. We end by discussing how the framework can be extended to cover models of the neural code, multi-stage models, and other domains. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: 50 pages, 11 figures

arXiv:2403.07902 [pdf, other]

DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design

Authors: Jiaqi Guan, Xiangxin Zhou, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu

Abstract: Designing 3D ligands within a target binding site is a fundamental task in drug discovery. Existing structured-based drug design methods treat all ligand atoms equally, which ignores different roles of atoms in the ligand for drug design and can be less efficient for exploring the large drug-like molecule space. In this paper, inspired by the convention in pharmaceutical practice, we decompose the… ▽ More Designing 3D ligands within a target binding site is a fundamental task in drug discovery. Existing structured-based drug design methods treat all ligand atoms equally, which ignores different roles of atoms in the ligand for drug design and can be less efficient for exploring the large drug-like molecule space. In this paper, inspired by the convention in pharmaceutical practice, we decompose the ligand molecule into two parts, namely arms and scaffold, and propose a new diffusion model, DecompDiff, with decomposed priors over arms and scaffold. In order to facilitate the decomposed generation and improve the properties of the generated molecules, we incorporate both bond diffusion in the model and additional validity guidance in the sampling phase. Extensive experiments on CrossDocked2020 show that our approach achieves state-of-the-art performance in generating high-affinity molecules while maintaining proper molecular properties and conformational stability, with up to -8.39 Avg. Vina Dock score and 24.5 Success Rate. The code is provided at https://github.com/bytedance/DecompDiff △ Less

Submitted 26 February, 2024; originally announced March 2024.

Comments: Accepted to ICML 2023

arXiv:2401.13022 [pdf]

Harmonizing the Generation and Pre-publication Stewardship of FAIR Image Data

Authors: Nikki Bialy, Frank Alber, Brenda Andrews, Michael Angelo, Brian Beliveau, Lacramioara Bintu, Alistair Boettiger, Ulrike Boehm, Claire M. Brown, Mahmoud Bukar Maina, James J. Chambers, Beth A. Cimini, Kevin Eliceiri, Rachel Errington, Orestis Faklaris, Nathalie Gaudreault, Ronald N. Germain, Wojtek Goscinski, David Grunwald, Michael Halter, Dorit Hanein, John W. Hickey, Judith Lacoste, Alex Laude, Emma Lundberg , et al. (22 additional authors not shown)

Abstract: Together with the molecular knowledge of genes and proteins, biological images promise to significantly enhance the scientific understanding of complex cellular systems and to advance predictive and personalized therapeutic products for human health. For this potential to be realized, quality-assured image data must be shared among labs at a global scale to be compared, pooled, and reanalyzed, thu… ▽ More Together with the molecular knowledge of genes and proteins, biological images promise to significantly enhance the scientific understanding of complex cellular systems and to advance predictive and personalized therapeutic products for human health. For this potential to be realized, quality-assured image data must be shared among labs at a global scale to be compared, pooled, and reanalyzed, thus unleashing untold potential beyond the original purpose for which the data was generated. There are two broad sets of requirements to enable image data sharing in the life sciences. One set of requirements is articulated in the companion White Paper entitled Enabling Global Image Data Sharing in the Life Sciences, which is published in parallel and addresses the need to build the cyberinfrastructure for sharing the digital array data. In this White Paper, we detail a broad set of requirements, which involves collecting, managing, presenting, and propagating contextual information essential to assess the quality, understand the content, interpret the scientific implications, and reuse image data in the context of the experimental details. We start by providing an overview of the main lessons learned to date through international community activities, which have recently made considerable progress toward generating community standard practices for imaging Quality Control (QC) and metadata. We then provide a clear set of recommendations for amplifying this work. The driving goal is to address remaining challenges and democratize access to everyday practices and tools for a spectrum of biomedical researchers, regardless of their expertise, access to resources, and geographical location. △ Less

Submitted 8 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

Comments: This manuscript is published with a closely related companion entitled, Enabling Global Image Data Sharing in the Life Sciences, which can be found at the following link, arXiv:2401.13023 [q-bio.OT]

arXiv:2401.08851 [pdf]

Using i-vectors for subject-independent cross-session EEG transfer learning

Authors: Jonathan Lasko, Jeff Ma, Mike Nicoletti, Jonathan Sussman-Fort, Sooyoung Jeong, William Hartmann

Abstract: Cognitive load classification is the task of automatically determining an individual's utilization of working memory resources during performance of a task based on physiologic measures such as electroencephalography (EEG). In this paper, we follow a cross-disciplinary approach, where tools and methodologies from speech processing are used to tackle this problem. The corpus we use was released pub… ▽ More Cognitive load classification is the task of automatically determining an individual's utilization of working memory resources during performance of a task based on physiologic measures such as electroencephalography (EEG). In this paper, we follow a cross-disciplinary approach, where tools and methodologies from speech processing are used to tackle this problem. The corpus we use was released publicly in 2021 as part of the first passive brain-computer interface competition on cross-session workload estimation. We present our approach which used i-vector-based neural network classifiers to accomplish inter-subject cross-session EEG transfer learning, achieving 18% relative improvement over equivalent subject-dependent models. We also report experiments showing how our subject-independent models perform competitively on held-out subjects and improve with additional subject data, suggesting that subject-dependent training is not required for effective cognitive load determination. △ Less

Submitted 16 January, 2024; originally announced January 2024.

Comments: 11 pages

arXiv:2312.00485 [pdf, other]

Backbone-based Dynamic Graph Spatio-Temporal Network for Epidemic Forecasting

Authors: Junkai Mao, Yuexing Han, Gouhei Tanaka, Bing Wang

Abstract: Accurate epidemic forecasting is a critical task in controlling disease transmission. Many deep learning-based models focus only on static or dynamic graphs when constructing spatial information, ignoring their relationship. Additionally, these models often rely on recurrent structures, which can lead to error accumulation and computational time consumption. To address the aforementioned problems,… ▽ More Accurate epidemic forecasting is a critical task in controlling disease transmission. Many deep learning-based models focus only on static or dynamic graphs when constructing spatial information, ignoring their relationship. Additionally, these models often rely on recurrent structures, which can lead to error accumulation and computational time consumption. To address the aforementioned problems, we propose a novel model called Backbone-based Dynamic Graph Spatio-Temporal Network (BDGSTN). Intuitively, the continuous and smooth changes in graph structure, make adjacent graph structures share a basic pattern. To capture this property, we use adaptive methods to generate static backbone graphs containing the primary information and temporal models to generate dynamic temporal graphs of epidemic data, fusing them to generate a backbone-based dynamic graph. To overcome potential limitations associated with recurrent structures, we introduce a linear model DLinear to handle temporal dependencies and combine it with dynamic graph convolution for epidemic forecasting. Extensive experiments on two datasets demonstrate that BDGSTN outperforms baseline models and ablation comparison further verifies the effectiveness of model components. Furthermore, we analyze and measure the significance of backbone and temporal graphs by using information metrics from different aspects. Finally, we compare model parameter volume and training time to confirm the superior complexity and efficiency of BDGSTN. △ Less

Submitted 1 December, 2023; originally announced December 2023.

arXiv:2311.15156 [pdf, other]

xTrimoGene: An Efficient and Scalable Representation Learner for Single-Cell RNA-Seq Data

Authors: Jing Gong, Minsheng Hao, Xingyi Cheng, Xin Zeng, Chiming Liu, Jianzhu Ma, Xuegong Zhang, Taifeng Wang, Le Song

Abstract: Advances in high-throughput sequencing technology have led to significant progress in measuring gene expressions at the single-cell level. The amount of publicly available single-cell RNA-seq (scRNA-seq) data is already surpassing 50M records for humans with each record measuring 20,000 genes. This highlights the need for unsupervised representation learning to fully ingest these data, yet classic… ▽ More Advances in high-throughput sequencing technology have led to significant progress in measuring gene expressions at the single-cell level. The amount of publicly available single-cell RNA-seq (scRNA-seq) data is already surpassing 50M records for humans with each record measuring 20,000 genes. This highlights the need for unsupervised representation learning to fully ingest these data, yet classical transformer architectures are prohibitive to train on such data in terms of both computation and memory. To address this challenge, we propose a novel asymmetric encoder-decoder transformer for scRNA-seq data, called xTrimoGene$^α$ (or xTrimoGene for short), which leverages the sparse characteristic of the data to scale up the pre-training. This scalable design of xTrimoGene reduces FLOPs by one to two orders of magnitude compared to classical transformers while maintaining high accuracy, enabling us to train the largest transformer models over the largest scRNA-seq dataset today. Our experiments also show that the performance of xTrimoGene improves as we scale up the model sizes, and it also leads to SOTA performance over various downstream tasks, such as cell type annotation, perturb-seq effect prediction, and drug combination prediction. xTrimoGene model is now available for use as a service via the following link: https://api.biomap.com/xTrimoGene/apply. △ Less

Submitted 24 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

Comments: Accepted by NeurIPS 2023

arXiv:2309.08478 [pdf, other]

Current and future directions in network biology

Authors: Marinka Zitnik, Michelle M. Li, Aydin Wells, Kimberly Glass, Deisy Morselli Gysi, Arjun Krishnan, T. M. Murali, Predrag Radivojac, Sushmita Roy, Anaïs Baudot, Serdar Bozdag, Danny Z. Chen, Lenore Cowen, Kapil Devkota, Anthony Gitter, Sara Gosline, Pengfei Gu, Pietro H. Guzzi, Heng Huang, Meng Jiang, Ziynet Nesibe Kesimoglu, Mehmet Koyuturk, Jian Ma, Alexander R. Pico, Nataša Pržulj , et al. (12 additional authors not shown)

Abstract: Network biology is an interdisciplinary field bridging computational and biological sciences that has proved pivotal in advancing the understanding of cellular functions and diseases across biological systems and scales. Although the field has been around for two decades, it remains nascent. It has witnessed rapid evolution, accompanied by emerging challenges. These challenges stem from various fa… ▽ More Network biology is an interdisciplinary field bridging computational and biological sciences that has proved pivotal in advancing the understanding of cellular functions and diseases across biological systems and scales. Although the field has been around for two decades, it remains nascent. It has witnessed rapid evolution, accompanied by emerging challenges. These challenges stem from various factors, notably the growing complexity and volume of data together with the increased diversity of data types describing different tiers of biological organization. We discuss prevailing research directions in network biology and highlight areas of inference and comparison of biological networks, multimodal data integration and heterogeneous networks, higher-order network analysis, machine learning on networks, and network-based personalized medicine. Following the overview of recent breakthroughs across these five areas, we offer a perspective on the future directions of network biology. Additionally, we offer insights into scientific communities, educational initiatives, and the importance of fostering diversity within the field. This paper establishes a roadmap for an immediate and long-term vision for network biology. △ Less

Submitted 11 June, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

Comments: 52 pages, 6 figures, 1 table

arXiv:2308.05864 [pdf, other]

doi 10.1038/s41592-024-02233-6

The Multi-modality Cell Segmentation Challenge: Towards Universal Solutions

Authors: Jun Ma, Ronald Xie, Shamini Ayyadhury, Cheng Ge, Anubha Gupta, Ritu Gupta, Song Gu, Yao Zhang, Gihun Lee, Joonkee Kim, Wei Lou, Haofeng Li, Eric Upschulte, Timo Dickscheid, José Guilherme de Almeida, Yixin Wang, Lin Han, Xin Yang, Marco Labagnara, Vojislav Gligorovski, Maxime Scheder, Sahand Jamal Rahi, Carly Kempster, Alice Pollitt, Leon Espinosa , et al. (15 additional authors not shown)

Abstract: Cell segmentation is a critical step for quantitative single-cell analysis in microscopy images. Existing cell segmentation methods are often tailored to specific modalities or require manual interventions to specify hyper-parameters in different experimental settings. Here, we present a multi-modality cell segmentation benchmark, comprising over 1500 labeled images derived from more than 50 diver… ▽ More Cell segmentation is a critical step for quantitative single-cell analysis in microscopy images. Existing cell segmentation methods are often tailored to specific modalities or require manual interventions to specify hyper-parameters in different experimental settings. Here, we present a multi-modality cell segmentation benchmark, comprising over 1500 labeled images derived from more than 50 diverse biological experiments. The top participants developed a Transformer-based deep-learning algorithm that not only exceeds existing methods but can also be applied to diverse microscopy images across imaging platforms and tissue types without manual parameter adjustments. This benchmark and the improved algorithm offer promising avenues for more accurate and versatile cell analysis in microscopy imaging. △ Less

Submitted 1 April, 2024; v1 submitted 10 August, 2023; originally announced August 2023.

Comments: NeurIPS22 Cell Segmentation Challenge: https://neurips22-cellseg.grand-challenge.org/ . Nature Methods (2024)

arXiv:2305.12471 [pdf, other]

Mapping Biological Neuron Dynamics into an Interpretable Two-layer Artificial Neural Network

Authors: Jingyang Ma, Songting Li, Douglas Zhou

Abstract: Dendrites are crucial structures for computation of an individual neuron. It has been shown that the dynamics of a biological neuron with dendrites can be approximated by artificial neural networks (ANN) with deep structure. However, it remains unclear whether a neuron can be further captured by a simple, biologically plausible ANN. In this work, we develop a two-layer ANN, named as dendritic bili… ▽ More Dendrites are crucial structures for computation of an individual neuron. It has been shown that the dynamics of a biological neuron with dendrites can be approximated by artificial neural networks (ANN) with deep structure. However, it remains unclear whether a neuron can be further captured by a simple, biologically plausible ANN. In this work, we develop a two-layer ANN, named as dendritic bilinear neural network (DBNN), to accurately predict both the sub-threshold voltage and spike time at the soma of biological neuron models with dendritic structure. Our DBNN is found to be interpretable and well captures the dendritic integration process of biological neurons including a bilinear rule revealed in previous works. In addition, we show DBNN is capable of performing diverse tasks including direction selectivity, coincidence detection, and image classification. Our work proposes a biologically interpretable ANN that characterizes the computation of biological neurons, which can be potentially implemented in the deep learning framework to improve computational ability. △ Less

Submitted 21 May, 2023; originally announced May 2023.

arXiv:2305.07508 [pdf, other]

MolDiff: Addressing the Atom-Bond Inconsistency Problem in 3D Molecule Diffusion Generation

Authors: Xingang Peng, Jiaqi Guan, Qiang Liu, Jianzhu Ma

Abstract: Deep generative models have recently achieved superior performance in 3D molecule generation. Most of them first generate atoms and then add chemical bonds based on the generated atoms in a post-processing manner. However, there might be no corresponding bond solution for the temporally generated atoms as their locations are generated without considering potential bonds. We define this problem as… ▽ More Deep generative models have recently achieved superior performance in 3D molecule generation. Most of them first generate atoms and then add chemical bonds based on the generated atoms in a post-processing manner. However, there might be no corresponding bond solution for the temporally generated atoms as their locations are generated without considering potential bonds. We define this problem as the atom-bond inconsistency problem and claim it is the main reason for current approaches to generating unrealistic 3D molecules. To overcome this problem, we propose a new diffusion model called MolDiff which can generate atoms and bonds simultaneously while still maintaining their consistency by explicitly modeling the dependence between their relationships. We evaluated the generation ability of our proposed model and the quality of the generated molecules using criteria related to both geometry and chemical properties. The empirical studies showed that our model outperforms previous approaches, achieving a three-fold improvement in success rate and generating molecules with significantly better quality. △ Less

Submitted 11 May, 2023; originally announced May 2023.

arXiv:2305.06769 [pdf]

Comparative Analysis of Machine Learning Algorithms for Predicting On-Target and Off-Target Effects of CRISPR-Cas13d for gene editing

Authors: Jingze Liu, Jiahao Ma

Abstract: CRISPR-Cas13 is a system that utilizes single stranded RNAs for RNA editing. Prediction of on-target and off-target effects for the CRISPR-Cas13d dependency enables us to design specific single guide RNAs (sgRNAs) that help locate the desired RNA target positions. In this study, we compared the performance of multiple machine learning algorithms in predicting these effects using a reported dataset… ▽ More CRISPR-Cas13 is a system that utilizes single stranded RNAs for RNA editing. Prediction of on-target and off-target effects for the CRISPR-Cas13d dependency enables us to design specific single guide RNAs (sgRNAs) that help locate the desired RNA target positions. In this study, we compared the performance of multiple machine learning algorithms in predicting these effects using a reported dataset. Our results show that Catboost is the most accurate model with high sensitivity. This finding represents a significant advancement in our understanding of how to chose modeling methods to deal with RNA sequence feaatures effictivelys. Furthermore, our approach can potentially be applied to other CRISPR systems and genetic engineering techniques. Overall, this work has important implications for developing safer and more effective gene therapies and biotechnological applications. △ Less

Submitted 11 May, 2023; originally announced May 2023.

Comments: code: https://www.kaggle.com/code/markblack370/cas13-pycaret/notebook

MSC Class: 68T05 ACM Class: I.2.6

arXiv:2304.13230 [pdf, other]

UNADON: Transformer-based model to predict genome-wide chromosome spatial position

Authors: Muyu Yang, Jian Ma

Abstract: The spatial positioning of chromosomes relative to functional nuclear bodies is intertwined with genome functions such as transcription. However, the sequence patterns and epigenomic features that collectively influence chromatin spatial positioning in a genome-wide manner are not well understood. Here, we develop a new transformer-based deep learning model called UNADON, which predicts the genome… ▽ More The spatial positioning of chromosomes relative to functional nuclear bodies is intertwined with genome functions such as transcription. However, the sequence patterns and epigenomic features that collectively influence chromatin spatial positioning in a genome-wide manner are not well understood. Here, we develop a new transformer-based deep learning model called UNADON, which predicts the genome-wide cytological distance to a specific type of nuclear body, as measured by TSA-seq, using both sequence features and epigenomic signals. Evaluations of UNADON in four cell lines (K562, H1, HFFc6, HCT116) show high accuracy in predicting chromatin spatial positioning to nuclear bodies when trained on a single cell line. UNADON also performed well in an unseen cell type. Importantly, we reveal potential sequence and epigenomic factors that affect large-scale chromatin compartmentalization to nuclear bodies. Together, UNADON provides new insights into the principles between sequence features and large-scale chromatin spatial localization, which has important implications for understanding nuclear structure and function. △ Less

Submitted 1 July, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

Comments: Published in ISMB 2023

arXiv:2303.03543 [pdf, other]

3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction

Authors: Jiaqi Guan, Wesley Wei Qian, Xingang Peng, Yufeng Su, Jian Peng, Jianzhu Ma

Abstract: Rich data and powerful machine learning models allow us to design drugs for a specific protein target \textit{in silico}. Recently, the inclusion of 3D structures during targeted drug design shows superior performance to other target-free models as the atomic interaction in the 3D space is explicitly modeled. However, current 3D target-aware models either rely on the voxelized atom densities or th… ▽ More Rich data and powerful machine learning models allow us to design drugs for a specific protein target \textit{in silico}. Recently, the inclusion of 3D structures during targeted drug design shows superior performance to other target-free models as the atomic interaction in the 3D space is explicitly modeled. However, current 3D target-aware models either rely on the voxelized atom densities or the autoregressive sampling process, which are not equivariant to rotation or easily violate geometric constraints resulting in unrealistic structures. In this work, we develop a 3D equivariant diffusion model to solve the above challenges. To achieve target-aware molecule design, our method learns a joint generative process of both continuous atom coordinates and categorical atom types with a SE(3)-equivariant network. Moreover, we show that our model can serve as an unsupervised feature extractor to estimate the binding affinity under proper parameterization, which provides an effective way for drug screening. To evaluate our model, we propose a comprehensive framework to evaluate the quality of sampled molecules from different dimensions. Empirical studies show our model could generate molecules with more realistic 3D structures and better affinities towards the protein targets, and improve binding affinity ranking and prediction without retraining. △ Less

Submitted 6 March, 2023; originally announced March 2023.

Comments: Accepted to ICLR 2023

arXiv:2302.00652 [pdf, other]

Breathing cluster in complex neuron-astrocyte networks

Authors: Ya Wang, Liang Wang, Huawei Fan, Jun Ma, Hui Cao, Xingang Wang

Abstract: Brain activities are featured by spatially distributed neural clusters of coherent firings and a spontaneous switching of the clusters between the synchrony and asynchrony states. Evidences from {\it in vivo} experiments suggest that astrocytes, a type of glial cell regarded previously as providing only structural and metabolic supports to neurons, participate actively in brain functions and play… ▽ More Brain activities are featured by spatially distributed neural clusters of coherent firings and a spontaneous switching of the clusters between the synchrony and asynchrony states. Evidences from {\it in vivo} experiments suggest that astrocytes, a type of glial cell regarded previously as providing only structural and metabolic supports to neurons, participate actively in brain functions and play a crucial role in regulating the neural firing activities, yet the mechanism remains unknown. Introducing astrocyte as a reservoir of the glutamate released from neuron synapses, here we propose the model of complex neuron-astrocyte network and employ it to explore the roles of astrocyte in regulating the synchronization behaviors of networked neurons. It is found that a fraction of neurons on the network can be synchronized as a cluster, while the remaining neurons are kept as desynchronized. Moreover, during the course of network evolution, the cluster is switching between the synchrony and asynchrony states intermittently, henceforth the phenomenon of ``breathing cluster". By the method of symmetry-based analysis, we conduct a theoretical investigation on the stability of the cluster and the mechanism generating the breathing activities. It is revealed that the contents of the cluster are determined by the network symmetry and the breathing activities are due to the interplay between the neural network and the astrocyte. The breathing phenomenon is demonstrated in network models of different structures and neural dynamics. The studies give insights into the cellular mechanism of astrocytes in regulating neural activities, and shed lights onto the spontaneous state switching of the neocortex. △ Less

Submitted 26 January, 2023; originally announced February 2023.

Comments: 14 pages, 6 figures

arXiv:2211.08084 [pdf]

Inferring cell-specific lncRNA regulation with single-cell RNA-sequencing data in the developing human neocortex

Authors: Meng Huang, Jiangtao Ma, Changzhou Long, Junpeng Zhang, Xiucai Ye, Tetsuya Sakurai

Abstract: Long non-coding RNAs (lncRNAs) are important regulators to modulate gene expression and cell proliferation in the developing human brain. Previous methods mainly use bulk lncRNA and mRNA expression data to study lncRNA regulation. However, to analyze lncRNA regulation regarding individual cells, we focus on single-cell RNA-sequencing (scRNA-seq) data instead of bulk data. Recent advance in scRNA-s… ▽ More Long non-coding RNAs (lncRNAs) are important regulators to modulate gene expression and cell proliferation in the developing human brain. Previous methods mainly use bulk lncRNA and mRNA expression data to study lncRNA regulation. However, to analyze lncRNA regulation regarding individual cells, we focus on single-cell RNA-sequencing (scRNA-seq) data instead of bulk data. Recent advance in scRNA-seq has provided a way to investigate lncRNA regulation at single-cell level. We will propose a novel computational method, CSlncR (cell-specific lncRNA regulation), which combines putative lncRNA-mRNA binding information with scRNA-seq data including lncRNAs and mRNAs to identify cell-specific lncRNA-mRNA regulation networks at individual cells. To understand lncRNA regulation at different development stages, we apply CSlncR to the scRNA-seq data of human neocortex. Network analysis shows that the lncRNA regulation is unique in each cell from the different human neocortex development stages. The comparison results indicate that CSlncR is also an effective tool for predicting cell-specific lncRNA targets and clustering single cells, which helps understand cell-cell communication. △ Less

Submitted 29 November, 2022; v1 submitted 15 November, 2022; originally announced November 2022.

arXiv:2208.14668 [pdf]

A Resonance Model for Spontaneous Cortical Activity

Authors: Yanjiang Wang, Jichao Ma, Jiebin Luo, Xue Chen, Yue Yuan

Abstract: How human brain function emerges from structure has intrigued researchers for decades and numerous models have been put forward, yet none of them yields a close structure-function relation. Here we present a resonance model based on neuronal spike timing dependent plasticity (STDP) principle to describe the spontaneous cortical activity by incorporating the dynamic interactions between neuronal po… ▽ More How human brain function emerges from structure has intrigued researchers for decades and numerous models have been put forward, yet none of them yields a close structure-function relation. Here we present a resonance model based on neuronal spike timing dependent plasticity (STDP) principle to describe the spontaneous cortical activity by incorporating the dynamic interactions between neuronal populations into a wave equation, which is able to accurately predict the resting brain functional connectivity (FC), including the resting-state networks. Besides, the proposed model provides strong theoretical and experimental evidences that the spontaneous dynamic coupling between brain regions fluctuates with a low frequency. Crucially, it is able to account for how the negative functional correlations emerge during resonance. We test the model with a large cohort of subjects (1038) from the Human Connectome Project (HCP) S1200 release in both time and frequency domain, which exhibits superior performance to existing eigen-decomposition models. △ Less

Submitted 6 October, 2022; v1 submitted 31 August, 2022; originally announced August 2022.

arXiv:2208.02433 [pdf, other]

Simulation and application of COVID-19 compartment model using physics-informed neural network

Authors: Jinhuan Ke, Jiahao Ma, Xiyu Yin, Robin Singh

Abstract: COVID-19 pandemic has had a disruptive and irreversible impact globally, yet traditional epidemiological modeling approaches such as the susceptible-infected-recovered (SIR) model have exhibited limited effectiveness in forecasting of the up-to-date pandemic situation. In this work, susceptible-vaccinated-exposed-infected-dead-recovered (SVEIDR) model and its variants -- aged and vaccination-struc… ▽ More COVID-19 pandemic has had a disruptive and irreversible impact globally, yet traditional epidemiological modeling approaches such as the susceptible-infected-recovered (SIR) model have exhibited limited effectiveness in forecasting of the up-to-date pandemic situation. In this work, susceptible-vaccinated-exposed-infected-dead-recovered (SVEIDR) model and its variants -- aged and vaccination-structured SVEIDR models -- are introduced to encode the effect of social contact for different age groups and vaccination status. Then, we implement the physics-informed neural network (PiNN) on both simulated and real-world data. The PiNN model enables robust analysis of the dynamic spread, prediction, and parameter optimization of the COVID-19 compartmental models. The models exhibit relative root mean square error (RRMSE) of $<4\%$ for all components and provide incubation, death, and recovery rates of $γ= 0.0130$, $λ=0.0001$, and $ρ=0.0037$, respectively, for the first 310 days of the epidemic in the US with RRMSE of $<0.35\%$ for all components. To further improve the model performance, temporally varying parameters can be included, such as vaccination, transmission, and incubation rates. Our implementation highlights PiNN as a reliable candidate approach for forecasting real-world data and can be applied to other compartmental model variants of interest. △ Less

Submitted 12 October, 2022; v1 submitted 3 August, 2022; originally announced August 2022.

arXiv:2207.03523 [pdf, ps, other]

Winning the lottery with neural connectivity constraints: faster learning across cognitive tasks with spatially constrained sparse RNNs

Authors: Mikail Khona, Sarthak Chandra, Joy J. Ma, Ila Fiete

Abstract: Recurrent neural networks (RNNs) are often used to model circuits in the brain, and can solve a variety of difficult computational problems requiring memory, error-correction, or selection [Hopfield, 1982, Maass et al., 2002, Maass, 2011]. However, fully-connected RNNs contrast structurally with their biological counterparts, which are extremely sparse (~0.1%). Motivated by the neocortex, where ne… ▽ More Recurrent neural networks (RNNs) are often used to model circuits in the brain, and can solve a variety of difficult computational problems requiring memory, error-correction, or selection [Hopfield, 1982, Maass et al., 2002, Maass, 2011]. However, fully-connected RNNs contrast structurally with their biological counterparts, which are extremely sparse (~0.1%). Motivated by the neocortex, where neural connectivity is constrained by physical distance along cortical sheets and other synaptic wiring costs, we introduce locality masked RNNs (LM-RNNs) that utilize task-agnostic predetermined graphs with sparsity as low as 4%. We study LM-RNNs in a multitask learning setting relevant to cognitive systems neuroscience with a commonly used set of tasks, 20-Cog-tasks [Yang et al., 2019]. We show through reductio ad absurdum that 20-Cog-tasks can be solved by a small pool of separated autapses that we can mechanistically analyze and understand. Thus, these tasks fall short of the goal of inducing complex recurrent dynamics and modular structure in RNNs. We next contribute a new cognitive multi-task battery, Mod-Cog, consisting of upto 132 tasks that expands by 7-fold the number of tasks and task-complexity of 20-Cog-tasks. Importantly, while autapses can solve the simple 20-Cog-tasks, the expanded task-set requires richer neural architectures and continuous attractor dynamics. On these tasks, we show that LM-RNNs with an optimal sparsity result in faster training and better data-efficiency than fully connected networks. △ Less

Submitted 29 May, 2023; v1 submitted 7 July, 2022; originally announced July 2022.

Comments: 12 pages, 5 main text figures

arXiv:2206.05059 [pdf]

Simulation, Modeling and Prediction of a Pharmacodynamic Animal Tissue Culture Compartment Model by Physical Informed Neural Network

Authors: Jiahao Ma

Abstract: Compartment models of cell culture are widely used in cytology, pharmacology, toxicology and other fields. Numerical simulation, data modeling and prediction of compartment models can be realized by traditional differential equation modeling methods. At the same time, with the development of software and hardware, Physical Informed Neural Network (PINN) is widely used to solve differential equatio… ▽ More Compartment models of cell culture are widely used in cytology, pharmacology, toxicology and other fields. Numerical simulation, data modeling and prediction of compartment models can be realized by traditional differential equation modeling methods. At the same time, with the development of software and hardware, Physical Informed Neural Network (PINN) is widely used to solve differential equation models. This work models, simulates and predicts the cell culture compartment model based on the machine learning framework PyTorch with an 16 hidden layers neural network, including 8 linear layers and 8 feedback active layers. The results showed a loss value of 0.0004853 for three-component four-parameter quantitative pharmacodynamic model predictions in this way, which is evaluated by Mean Square Error (MSE). In summary, Physical Informed Neural Network can serve as an effective tool to deal with cell culture compartment models and may perform better in dealing with big datasets. △ Less

Submitted 9 June, 2022; originally announced June 2022.

Comments: 7 pages, 5 figures

arXiv:2205.14195 [pdf, other]

Unsupervised learning of features and object boundaries from local prediction

Authors: Heiko H. Schütt, Wei Ji Ma

Abstract: A visual system has to learn both which features to extract from images and how to group locations into (proto-)objects. Those two aspects are usually dealt with separately, although predictability is discussed as a cue for both. To incorporate features and boundaries into the same model, we model a layer of feature maps with a pairwise Markov random field model in which each factor is paired with… ▽ More A visual system has to learn both which features to extract from images and how to group locations into (proto-)objects. Those two aspects are usually dealt with separately, although predictability is discussed as a cue for both. To incorporate features and boundaries into the same model, we model a layer of feature maps with a pairwise Markov random field model in which each factor is paired with an additional binary variable, which switches the factor on or off. Using one of two contrastive learning objectives, we can learn both the features and the parameters of the Markov random field factors from images without further supervision signals. The features learned by shallow neural networks based on this loss are local averages, opponent colors, and Gabor-like stripe patterns. Furthermore, we can infer connectivity between locations by inferring the switch variables. Contours inferred from this connectivity perform quite well on the Berkeley segmentation database (BSDS500) without any training on contours. Thus, computing predictions across space aids both segmentation and feature learning, and models trained to optimize these predictions show similarities to the human visual system. We speculate that retinotopic visual cortex might implement such predictions over space through lateral connections. △ Less

Submitted 27 May, 2022; originally announced May 2022.

Comments: Submitted to NeurIPS 2022

arXiv:2205.07309 [pdf, other]

3DLinker: An E(3) Equivariant Variational Autoencoder for Molecular Linker Design

Authors: Yinan Huang, Xingang Peng, Jianzhu Ma, Muhan Zhang

Abstract: Deep learning has achieved tremendous success in designing novel chemical compounds with desirable pharmaceutical properties. In this work, we focus on a new type of drug design problem -- generating a small "linker" to physically attach two independent molecules with their distinct functions. The main computational challenges include: 1) the generation of linkers is conditional on the two given m… ▽ More Deep learning has achieved tremendous success in designing novel chemical compounds with desirable pharmaceutical properties. In this work, we focus on a new type of drug design problem -- generating a small "linker" to physically attach two independent molecules with their distinct functions. The main computational challenges include: 1) the generation of linkers is conditional on the two given molecules, in contrast to generating full molecules from scratch in previous works; 2) linkers heavily depend on the anchor atoms of the two molecules to be connected, which are not known beforehand; 3) 3D structures and orientations of the molecules need to be considered to avoid atom clashes, for which equivariance to E(3) group are necessary. To address these problems, we propose a conditional generative model, named 3DLinker, which is able to predict anchor atoms and jointly generate linker graphs and their 3D structures based on an E(3) equivariant graph variational autoencoder. So far as we know, there are no previous models that could achieve this task. We compare our model with multiple conditional generative models modified from other molecular design tasks and find that our model has a significantly higher rate in recovering molecular graphs, and more importantly, accurately predicting the 3D coordinates of all the atoms. △ Less

Submitted 15 May, 2022; originally announced May 2022.

arXiv:2205.07249 [pdf, other]

Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets

Authors: Xingang Peng, Shitong Luo, Jiaqi Guan, Qi Xie, Jian Peng, Jianzhu Ma

Abstract: Deep generative models have achieved tremendous success in designing novel drug molecules in recent years. A new thread of works have shown the great potential in advancing the specificity and success rate of in silico drug design by considering the structure of protein pockets. This setting posts fundamental computational challenges in sampling new chemical compounds that could satisfy multiple g… ▽ More Deep generative models have achieved tremendous success in designing novel drug molecules in recent years. A new thread of works have shown the great potential in advancing the specificity and success rate of in silico drug design by considering the structure of protein pockets. This setting posts fundamental computational challenges in sampling new chemical compounds that could satisfy multiple geometrical constraints imposed by pockets. Previous sampling algorithms either sample in the graph space or only consider the 3D coordinates of atoms while ignoring other detailed chemical structures such as bond types and functional groups. To address the challenge, we develop Pocket2Mol, an E(3)-equivariant generative network composed of two modules: 1) a new graph neural network capturing both spatial and bonding relationships between atoms of the binding pockets and 2) a new efficient algorithm which samples new drug candidates conditioned on the pocket representations from a tractable distribution without relying on MCMC. Experimental results demonstrate that molecules sampled from Pocket2Mol achieve significantly better binding affinity and other drug properties such as druglikeness and synthetic accessibility. △ Less

Submitted 15 May, 2022; originally announced May 2022.

Comments: ICML 2022 accepted

arXiv:2205.03583 [pdf]

Scanning Electron Microscopy and Metabolite Measurement Revealed the Stress Mechanism of PS-COOH Microplastics on Rhodotorula mucilaginosa AN5

Authors: Jiahao Ma, Xiangfei Meng, Zixin Li, Lexian Li, Jiwen Xu, Guangfeng Kan

Abstract: Microplastics in the marine environment have been paid more and more attention by researchers, and the impact of these substances on marine microorganisms can not be ignored. Studies have shown that PS-COOH Microplastics are harmful to marine molluscs, algae and monads. This study explore the effect and mechanism of microplastics (80 nm PS-COOH) on Antarctic marine yeast, Rhodotorula mucilaginosa… ▽ More Microplastics in the marine environment have been paid more and more attention by researchers, and the impact of these substances on marine microorganisms can not be ignored. Studies have shown that PS-COOH Microplastics are harmful to marine molluscs, algae and monads. This study explore the effect and mechanism of microplastics (80 nm PS-COOH) on Antarctic marine yeast, Rhodotorula mucilaginosa AN5 by bacterial count, Scanning Electron Microscopy (SEM) and metabolite analysis. The results illustrates that a 50 mg/L concentration of PS-COOH could inhibit 36.15% growth of yeast cells and 10 mg/L inhibit 80.20%. Microplastics stress causes changes in the content of some oxidative stress substances, including reactive oxygen species (ROS) 42.86% , malondialdehyde (MDA) 54.06% content and the activities of antioxidant enzymes such as catalase (CAT) 36.00% , peroxidase (POD) 66.67% and superoxide dismutase (SOD) 25.40%. These results revealed the possible stress effect of microplastic pollution on marine yeast and may affect bottom layer of marine ecosystem. △ Less

Submitted 13 September, 2022; v1 submitted 7 May, 2022; originally announced May 2022.

arXiv:2204.11026 [pdf]

Bioinformatic analysis for structure and function of Glutamine synthetase(GS)

Authors: Jiahao Ma, Guotong Xu, Le Ao, Siqi Chen, Jingze Liu

Abstract: Objective: To predict structure and function of Glutamine synthetase (GS) from Pseudoalteromonas sp. by bioinformatics technology, and to provide a theoretical basis for further study. Methods: Open reading frame (ORF) of GS sequence from Pseudoalteromonas sp. was obtained by ORF finder and was translated into amino acid residue. The structure domain was analyzed by Blast. By the method of analysi… ▽ More Objective: To predict structure and function of Glutamine synthetase (GS) from Pseudoalteromonas sp. by bioinformatics technology, and to provide a theoretical basis for further study. Methods: Open reading frame (ORF) of GS sequence from Pseudoalteromonas sp. was obtained by ORF finder and was translated into amino acid residue. The structure domain was analyzed by Blast. By the method of analysis tools: Protparam, ProtScale, SignalP-4.0, TMHMM, SOPMA, SWISS-MODEL, NCBI SMART-BLAST and MAGA 7.0, the structure and function of the protein were predicted and analyzed. Results: The results showed that the sequence was GS with 468 amino acid residues, theoretical molecular weight was 51986.64 Da. The protein has the closest evolutionary status with Shewanella oneidensis. Then it had no signal peptide site and transmembrane domain. Secondary structure of GS contained 35.04% alpha-helix, 16.67% Extended chain, 5.34% beta-turn, 42.95% RandomCoil. Conclusions: This GU was a variety of biological functions of protein that may be used as a molecular samples of microbial nitrogen metabolism in extreme environments. △ Less

Submitted 23 April, 2022; originally announced April 2022.

Comments: 8 pages, 8 figures

arXiv:2203.10446 [pdf, other]

A 3D Generative Model for Structure-Based Drug Design

Authors: Shitong Luo, Jiaqi Guan, Jianzhu Ma, Jian Peng

Abstract: We study a fundamental problem in structure-based drug design -- generating molecules that bind to specific protein binding sites. While we have witnessed the great success of deep generative models in drug design, the existing methods are mostly string-based or graph-based. They are limited by the lack of spatial information and thus unable to be applied to structure-based design tasks. Particula… ▽ More We study a fundamental problem in structure-based drug design -- generating molecules that bind to specific protein binding sites. While we have witnessed the great success of deep generative models in drug design, the existing methods are mostly string-based or graph-based. They are limited by the lack of spatial information and thus unable to be applied to structure-based design tasks. Particularly, such models have no or little knowledge of how molecules interact with their target proteins exactly in 3D space. In this paper, we propose a 3D generative model that generates molecules given a designated 3D protein binding site. Specifically, given a binding site as the 3D context, our model estimates the probability density of atom's occurrences in 3D space -- positions that are more likely to have atoms will be assigned higher probability. To generate 3D molecules, we propose an auto-regressive sampling scheme -- atoms are sampled sequentially from the learned distribution until there is no room for new atoms. Combined with this sampling scheme, our model can generate valid and diverse molecules, which could be applicable to various structure-based molecular design tasks such as molecule sampling and linker design. Experimental results demonstrate that molecules sampled from our model exhibit high binding affinity to specific targets and good drug properties such as drug-likeness even if the model is not explicitly optimized for them. △ Less

Submitted 12 November, 2022; v1 submitted 19 March, 2022; originally announced March 2022.

Comments: Accepted to NeurIPS 2021

arXiv:2202.07921 [pdf, other]

Comparison on gait characteristics between controlled and free-living conditions in old adults

Authors: Jian Ma

Abstract: Gait is an important biomarker of functional conditions and gait characteristics can help us assessing health conditions and managing progression of diseases. Most of the existing research study the gait in controlled condition, such as clinical tests. In this paper, we study the gait characteristics in free-living conditions in old adults and compare them with that in controlled conditions, i.e.,… ▽ More Gait is an important biomarker of functional conditions and gait characteristics can help us assessing health conditions and managing progression of diseases. Most of the existing research study the gait in controlled condition, such as clinical tests. In this paper, we study the gait characteristics in free-living conditions in old adults and compare them with that in controlled conditions, i.e., Timed Up and Go (TUG) test. 65 subjects (12 patients with mobility impairment and 53 healthy controls) are recruited from elderly nursing institutions. The video data are collected from them in TUG test and free-living conditions and the 9 gait characteristics, including gait speed, are extracted from the data. Two-sample tests and independence test based on copula entropy are conducted on the extracted data to compare the characteristics in two conditions. Comparison results show that gait characteristics, such as gait speed, pace, speed variability, etc., in daily life are different from that of in TUG test. In daily life, people tend to have slow gait speed, smaller pace and speed variability, more frequent stride, and smaller acceleration range than in TUG test. We also found that gait speed, pace, and speed variability have stronger dependence with TUG score in the 3 conditions (TUG, daily life, and both) and that other 5 characteristics have stronger dependence with TUG score in both condition than in each condition. The comparison in this study suggests that TUG and daily life conditions are complementary with each other, and that TUG test can be considered as intervention on the movement state of human. △ Less

Submitted 16 February, 2022; originally announced February 2022.

Comments: 16 pages, 7 figures, 4 tables

arXiv:2202.04324 [pdf]

doi 10.1038/s41593-023-01444-y

Studying the neural representations of uncertainty

Authors: Edgar Y Walker, Stephan Pohl, Rachel N Denison, David L Barack, Jennifer Lee, Ned Block, Wei Ji Ma, Florent Meyniel

Abstract: The study of the brain's representations of uncertainty is a central topic in neuroscience. Unlike most quantities of which the neural representation is studied, uncertainty is a property of an observer's beliefs about the world, which poses specific methodological challenges. We analyze how the literature on the neural representations of uncertainty addresses those challenges and distinguish betw… ▽ More The study of the brain's representations of uncertainty is a central topic in neuroscience. Unlike most quantities of which the neural representation is studied, uncertainty is a property of an observer's beliefs about the world, which poses specific methodological challenges. We analyze how the literature on the neural representations of uncertainty addresses those challenges and distinguish between "code-driven" and "correlational" approaches. Code-driven approaches make assumptions about the neural code for representing world states and the associated uncertainty. By contrast, correlational approaches search for relationships between uncertainty and neural activity without constraints on the neural representation of the world state that this uncertainty accompanies. To compare these two approaches, we apply several criteria for neural representations: sensitivity, specificity, invariance, functionality. Our analysis reveals that the two approaches lead to different, but complementary findings, shaping new research questions and guiding future experiments. △ Less

Submitted 11 October, 2023; v1 submitted 9 February, 2022; originally announced February 2022.

Comments: 23 pages, 3 figures. Nature Neuroscience (2023)

arXiv:2201.04697 [pdf, ps, other]

doi 10.1093/comnet/cnac021

Peak fraction of infected in epidemic spreading for multi-community networks

Authors: Jing Ma, Xiangyi Meng, Lidia A. Braunstein

Abstract: One of the most effective strategies to mitigate the global spreading of a pandemic (e.g., COVID-19) is to shut down international airports. From a network theory perspective, this is since international airports and flights, essentially playing the roles of bridge nodes and bridge links between countries as individual communities, dominate the epidemic spreading characteristics in the whole multi… ▽ More One of the most effective strategies to mitigate the global spreading of a pandemic (e.g., COVID-19) is to shut down international airports. From a network theory perspective, this is since international airports and flights, essentially playing the roles of bridge nodes and bridge links between countries as individual communities, dominate the epidemic spreading characteristics in the whole multi-community system. Among all epidemic characteristics, the peak fraction of infected, $I_{\max}$, is a decisive factor in evaluating an epidemic strategy given limited capacity of medical resources, but is seldom considered in multi-community models. In this paper, we study a general two-community system interconnected by a fraction $r$ of bridge nodes and its dynamic properties, especially $I_{\max}$, under the evolution of the Susceptible-Infected-Recovered (SIR) model. Comparing the characteristic time scales of different parts of the system allows us to analytically derive the asymptotic behavior of $I_{\max}$ with $r$, as $r\rightarrow 0$, which follows different power-law relations in each regime of the phase diagram. We also detect crossovers when $I_{\max}$ changes from one power law to another, crossing different power-law regimes as driven by $r$. Our results enable a better prediction of the effectiveness of strategies acting on bridge nodes, denoted by the power-law exponent $ε_I$ as in $I_{\max}\propto r^{1/ε_I}$. △ Less

Submitted 20 June, 2022; v1 submitted 12 January, 2022; originally announced January 2022.

Comments: 19 pages, 6 figures, 3 tables

Journal ref: Journal of Complex Networks 10 (3), cnac021 (2022)

arXiv:2112.03266 [pdf, other]

Contrastive Cycle Adversarial Autoencoders for Single-cell Multi-omics Alignment and Integration

Authors: Xuesong Wang, Zhihang Hu, Tingyang Yu, Ruijie Wang, Yumeng Wei, Juan Shu, Jianzhu Ma, Yu Li

Abstract: Muilti-modality data are ubiquitous in biology, especially that we have entered the multi-omics era, when we can measure the same biological object (cell) from different aspects (omics) to provide a more comprehensive insight into the cellular system. When dealing with such multi-omics data, the first step is to determine the correspondence among different modalities. In other words, we should mat… ▽ More Muilti-modality data are ubiquitous in biology, especially that we have entered the multi-omics era, when we can measure the same biological object (cell) from different aspects (omics) to provide a more comprehensive insight into the cellular system. When dealing with such multi-omics data, the first step is to determine the correspondence among different modalities. In other words, we should match data from different spaces corresponding to the same object. This problem is particularly challenging in the single-cell multi-omics scenario because such data are very sparse with extremely high dimensions. Secondly, matched single-cell multi-omics data are rare and hard to collect. Furthermore, due to the limitations of the experimental environment, the data are usually highly noisy. To promote the single-cell multi-omics research, we overcome the above challenges, proposing a novel framework to align and integrate single-cell RNA-seq data and single-cell ATAC-seq data. Our approach can efficiently map the above data with high sparsity and noise from different spaces to a low-dimensional manifold in a unified space, making the downstream alignment and integration straightforward. Compared with the other state-of-the-art methods, our method performs better in both simulated and real single-cell data. The proposed method is helpful for the single-cell multi-omics research. The improvement for integration on the simulated data is significant. △ Less

Submitted 13 December, 2021; v1 submitted 5 December, 2021; originally announced December 2021.

arXiv:2110.08471 [pdf, other]

Fast Projection onto the Capped Simplex with Applications to Sparse Regression in Bioinformatics

Authors: Andersen Ang, Jianzhu Ma, Nianjun Liu, Kun Huang, Yijie Wang

Abstract: We consider the problem of projecting a vector onto the so-called k-capped simplex, which is a hyper-cube cut by a hyperplane. For an n-dimensional input vector with bounded elements, we found that a simple algorithm based on Newton's method is able to solve the projection problem to high precision with a complexity roughly about O(n), which has a much lower computational cost compared with the ex… ▽ More We consider the problem of projecting a vector onto the so-called k-capped simplex, which is a hyper-cube cut by a hyperplane. For an n-dimensional input vector with bounded elements, we found that a simple algorithm based on Newton's method is able to solve the projection problem to high precision with a complexity roughly about O(n), which has a much lower computational cost compared with the existing sorting-based methods proposed in the literature. We provide a theory for partial explanation and justification of the method. We demonstrate that the proposed algorithm can produce a solution of the projection problem with high precision on large scale datasets, and the algorithm is able to significantly outperform the state-of-the-art methods in terms of runtime (about 6-8 times faster than a commercial software with respect to CPU time for input vector with 1 million variables or more). We further illustrate the effectiveness of the proposed algorithm on solving sparse regression in a bioinformatics problem. Empirical results on the GWAS dataset (with 1,500,000 single-nucleotide polymorphisms) show that, when using the proposed method to accelerate the Projected Quasi-Newton (PQN) method, the accelerated PQN algorithm is able to handle huge-scale regression problem and it is more efficient (about 3-6 times faster) than the current state-of-the-art methods. △ Less

Submitted 25 October, 2021; v1 submitted 16 October, 2021; originally announced October 2021.

Comments: 12 pages, 5 figures

arXiv:2103.01606 [pdf]

Time-dependent Clearance of Cyclosporine in Adult Renal Transplant Recipients: A Population Pharmacokinetic Perspective

Authors: Junjun Mao, Xiaoyan Qiu, Weiwei Qin, Luyang Xu, Ming Zhang, Mingkang Zhong

Abstract: Aim The pharmacokinetic (PK) properties of cyclosporine (CsA) in renal transplant recipients are patient- and time-dependent. Knowledge of this time-related variability is necessary to maintain or achieve CsA target exposure. Here, we aimed to identify factors explaining variabilities in CsA PK properties and characterise time-dependent clearance (CL/F) by performing a comprehensive analysis of Cs… ▽ More Aim The pharmacokinetic (PK) properties of cyclosporine (CsA) in renal transplant recipients are patient- and time-dependent. Knowledge of this time-related variability is necessary to maintain or achieve CsA target exposure. Here, we aimed to identify factors explaining variabilities in CsA PK properties and characterise time-dependent clearance (CL/F) by performing a comprehensive analysis of CsA PK factors using population PK (popPK) modelling of long-term follow-up data from our institution. Methods In total, 3,674 whole-blood CsA concentrations from 183 patients who underwent initial renal transplantation were analysed using nonlinear mixed-effects modelling. The effects of potential covariates were selected according to a previous report and well-accepted theoretical mechanisms. Model-informed individualised therapeutic regimens were also conducted. Results A two-compartment model adequately described the data and the estimated mean CsA CL/F was 32.6 L h-1 (5%). Allometrically scaled body size, haematocrit (HCT) level, CGC haplotype carrier status, and postoperative time may contribute to CsA PK variability. The CsA bioavailability in patients receiving a prednisolone dose (PD) of 80 mg was 20.6% lower than that in patients receiving 20 mg. A significant decrease (52.6%) in CL/F was observed as the HCT increased from 10.5% to 60.5%. The CL/F of the non-CGC haplotype carrier was 14.4% lower than that of the CGC haplotype carrier at 3 months post operation. CsA dose adjustments should be considered in different postoperative periods. Conclusions By monitoring body size, HCT, PD, and CGC haplotype, changes in CsA CL/F over time could be predicted. Such information could be used to optimise CsA therapy. △ Less

Submitted 2 March, 2021; originally announced March 2021.

arXiv:2012.05038 [pdf]

Cost-efficiency trade-offs of the human brain network revealed by a multiobjective evolutionary algorithm

Authors: Junji Ma, Jinbo Zhang, Ying Lin, Zhengjia Dai

Abstract: It is widely believed that the formation of brain network structure is under the pressure of optimal trade-off between reducing wiring cost and promoting communication efficiency. However, the question of whether this trade-off exists in empirical human brain networks and, if so, how it takes effect is still not well understood. Here, we employed a multiobjective evolutionary algorithm to directly… ▽ More It is widely believed that the formation of brain network structure is under the pressure of optimal trade-off between reducing wiring cost and promoting communication efficiency. However, the question of whether this trade-off exists in empirical human brain networks and, if so, how it takes effect is still not well understood. Here, we employed a multiobjective evolutionary algorithm to directly and quantitatively explore the cost-efficiency trade-off in human brain networks. Using this algorithm, we generated a population of synthetic networks with optimal but diverse cost-efficiency trade-offs. It was found that these synthetic networks could not only reproduce a large portion of connections in the empirical brain networks but also embed a resembling small-world structure. Moreover, the synthetic and empirical brain networks were found similar in terms of the spatial arrangement of hub regions and the modular structure, which are two important topological features widely assumed to be outcomes of cost-efficiency trade-offs. The synthetic networks had high robustness against random attack as the empirical brain networks did. Additionally, we also revealed some differences of the synthetic networks from the empirical brain networks, including lower segregated processing capacity and weaker robustness against targeted attack. These findings provide direct and quantitative evidence that the structure of human brain networks is indeed largely influenced by optimal cost-efficiency trade-offs. We also suggest that some additional factors (e.g., segregated processing capacity) might jointly determine the network organization with cost and efficiency. △ Less

Submitted 9 December, 2020; originally announced December 2020.

arXiv:2011.11396 [pdf]

THCluster: herb supplements categorization for precision traditional Chinese medicine

Authors: Chunyang Ruan, Ye Wang, Yanchun Zhang, Jiangang Ma, Huijuan Chen, Uwe Aickelin, Shanfeng Zhu, Ting Zhang

Abstract: There has been a continuing demand for traditional and complementary medicine worldwide. A fundamental and important topic in Traditional Chinese Medicine (TCM) is to optimize the prescription and to detect herb regularities from TCM data. In this paper, we propose a novel clustering model to solve this general problem of herb categorization, a pivotal task of prescription optimization and herb re… ▽ More There has been a continuing demand for traditional and complementary medicine worldwide. A fundamental and important topic in Traditional Chinese Medicine (TCM) is to optimize the prescription and to detect herb regularities from TCM data. In this paper, we propose a novel clustering model to solve this general problem of herb categorization, a pivotal task of prescription optimization and herb regularities. The model utilizes Random Walks method, Bayesian rules and Expectation Maximization(EM) models to complete a clustering analysis effectively on a heterogeneous information network. We performed extensive experiments on the real-world datasets and compared our method with other algorithms and experts. Experimental results have demonstrated the effectiveness of the proposed model for discovering useful categorization of herbs and its potential clinical manifestations. △ Less

Submitted 19 November, 2020; originally announced November 2020.

Comments: 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Pages 417-424

arXiv:2006.16648 [pdf, other]

Associations between finger tapping, gait and fall risk with application to fall risk assessment

Authors: Jian Ma

Abstract: As the world ages, elderly care becomes a big concern of the society. To address the elderly's issues on dementia and fall risk, we have investigated smart cognitive and fall risk assessment with machine learning methodology based on the data collected from finger tapping test and Timed Up and Go (TUG) test. Meanwhile, we have discovered the associations between cognition and finger motion from fi… ▽ More As the world ages, elderly care becomes a big concern of the society. To address the elderly's issues on dementia and fall risk, we have investigated smart cognitive and fall risk assessment with machine learning methodology based on the data collected from finger tapping test and Timed Up and Go (TUG) test. Meanwhile, we have discovered the associations between cognition and finger motion from finger tapping data and the association between fall risk and gait characteristics from TUG data. In this paper, we jointly analyze the finger tapping and gait characteristics data with copula entropy. We find that the associations between certain finger tapping characteristics ('number of taps', 'average interval of tapping', 'frequency of tapping' of both hands of bimanual inphase and those of left hand of bimanual untiphase) and TUG score are relatively high. According to this finding, we propose to utilize this associations to improve the predictive models of automatic fall risk assessment we developed previously. Experimental results show that using the characteristics of both finger tapping and gait as inputs of the predictive models of predicting TUG score can considerably improve the prediction performance in terms of MAE compared with using only one type of characteristics. △ Less

Submitted 22 February, 2021; v1 submitted 30 June, 2020; originally announced June 2020.

arXiv:2005.14170 [pdf, other]

doi 10.1103/PhysRevE.102.032308

Role of bridge nodes in epidemic spreading: Different regimes and crossovers

Authors: Jing Ma, Lucas D. Valdez, Lidia A. Braunstein

Abstract: Power-law behaviors are common in many disciplines, especially in network science. Real-world networks, like disease spreading among people, are more likely to be interconnected communities, and show richer power-law behaviors than isolated networks. In this paper, we look at the system of two communities which are connected by bridge links between a fraction $r$ of bridge nodes, and study the eff… ▽ More Power-law behaviors are common in many disciplines, especially in network science. Real-world networks, like disease spreading among people, are more likely to be interconnected communities, and show richer power-law behaviors than isolated networks. In this paper, we look at the system of two communities which are connected by bridge links between a fraction $r$ of bridge nodes, and study the effect of bridge nodes to the final state of the Susceptible-Infected-Recovered model, by mapping it to link percolation. By keeping a fixed average connectivity, but allowing different transmissibilities along internal and bridge links, we theoretically derive different power-law asymptotic behaviors of the total fraction of the recovered $R$ in the final state as $r$ goes to zero, for different combinations of internal and bridge link transmissibilities. We also find crossover points where $R$ follows different power-law behaviors with $r$ on both sides when the internal transmissibility is below but close to its critical value, for different bridge link transmissibilities. All of these power-law behaviors can be explained through different mechanisms of how finite clusters in each community are connected into the giant component of the whole system, and enable us to pick effective epidemic strategies and to better predict their impacts. △ Less

Submitted 28 September, 2020; v1 submitted 28 May, 2020; originally announced May 2020.

Journal ref: Phys. Rev. E 102, 032308 (2020)

arXiv:2005.02181 [pdf, other]

A neural network walks into a lab: towards using deep nets as models for human behavior

Authors: Wei Ji Ma, Benjamin Peters

Abstract: What might sound like the beginning of a joke has become an attractive prospect for many cognitive scientists: the use of deep neural network models (DNNs) as models of human behavior in perceptual and cognitive tasks. Although DNNs have taken over machine learning, attempts to use them as models of human behavior are still in the early stages. Can they become a versatile model class in the cognit… ▽ More What might sound like the beginning of a joke has become an attractive prospect for many cognitive scientists: the use of deep neural network models (DNNs) as models of human behavior in perceptual and cognitive tasks. Although DNNs have taken over machine learning, attempts to use them as models of human behavior are still in the early stages. Can they become a versatile model class in the cognitive scientist's toolbox? We first argue why DNNs have the potential to be interesting models of human behavior. We then discuss how that potential can be more fully realized. On the one hand, we argue that the cycle of training, testing, and revising DNNs needs to be revisited through the lens of the cognitive scientist's goals. Specifically, we argue that methods for assessing the goodness of fit between DNN models and human behavior have to date been impoverished. On the other hand, cognitive science might have to start using more complex tasks (including richer stimulus spaces), but doing so might be beneficial for DNN-independent reasons as well. Finally, we highlight avenues where traditional cognitive process models and DNNs may show productive synergy. △ Less

Submitted 2 May, 2020; originally announced May 2020.

arXiv:2004.13167 [pdf, other]

Energy-based models for atomic-resolution protein conformations

Authors: Yilun Du, Joshua Meier, Jerry Ma, Rob Fergus, Alexander Rives

Abstract: We propose an energy-based model (EBM) of protein conformations that operates at atomic scale. The model is trained solely on crystallized protein data. By contrast, existing approaches for scoring conformations use energy functions that incorporate knowledge of physical principles and features that are the complex product of several decades of research and tuning. To evaluate the model, we benchm… ▽ More We propose an energy-based model (EBM) of protein conformations that operates at atomic scale. The model is trained solely on crystallized protein data. By contrast, existing approaches for scoring conformations use energy functions that incorporate knowledge of physical principles and features that are the complex product of several decades of research and tuning. To evaluate the model, we benchmark on the rotamer recovery task, the problem of predicting the conformation of a side chain from its context within a protein structure, which has been used to evaluate energy functions for protein design. The model achieves performance close to that of the Rosetta energy function, a state-of-the-art method widely used in protein structure prediction and design. An investigation of the model's outputs and hidden representations finds that it captures physicochemical properties relevant to protein energy. △ Less

Submitted 27 April, 2020; originally announced April 2020.

Comments: Accepted to ICLR 2020

Journal ref: International Conference on Learning Representations (ICLR), 2020

arXiv:2004.11841 [pdf, other]

Risk Estimation of SARS-CoV-2 Transmission from Bluetooth Low Energy Measurements

Authors: Felix Sattler, Jackie Ma, Patrick Wagner, David Neumann, Markus Wenzel, Ralf Schäfer, Wojciech Samek, Klaus-Robert Müller, Thomas Wiegand

Abstract: Digital contact tracing approaches based on Bluetooth low energy (BLE) have the potential to efficiently contain and delay outbreaks of infectious diseases such as the ongoing SARS-CoV-2 pandemic. In this work we propose a novel machine learning based approach to reliably detect subjects that have spent enough time in close proximity to be at risk of being infected. Our study is an important proof… ▽ More Digital contact tracing approaches based on Bluetooth low energy (BLE) have the potential to efficiently contain and delay outbreaks of infectious diseases such as the ongoing SARS-CoV-2 pandemic. In this work we propose a novel machine learning based approach to reliably detect subjects that have spent enough time in close proximity to be at risk of being infected. Our study is an important proof of concept that will aid the battery of epidemiological policies aiming to slow down the rapid spread of COVID-19. △ Less

Submitted 22 April, 2020; originally announced April 2020.

arXiv:2004.08730 [pdf, other]

doi 10.1007/978-981-16-6372-7_34

Predicting MMSE Score from Finger-Tapping Measurement

Authors: Jian Ma

Abstract: Dementia is a leading cause of diseases for the elderly. Early diagnosis is very important for the elderly living with dementias. In this paper, we propose a method for dementia diagnosis by predicting MMSE score from finger-tapping measurement with machine learning pipeline. Based on measurement of finger tapping movement, the pipeline is first to select finger-tapping attributes with copula entr… ▽ More Dementia is a leading cause of diseases for the elderly. Early diagnosis is very important for the elderly living with dementias. In this paper, we propose a method for dementia diagnosis by predicting MMSE score from finger-tapping measurement with machine learning pipeline. Based on measurement of finger tapping movement, the pipeline is first to select finger-tapping attributes with copula entropy and then to predict MMSE score from the selected attributes with predictive models. Experiments on real world data show that the predictive models such developed present good prediction performance. As a byproduct, the associations between certain finger-tapping attributes ('Number of taps', 'Average of intervals', and 'Frequency of taps' of both hands of bimanual in-phase task) and MMSE score are discovered with copula entropy, which may be interpreted as the biological relationship between cognitive ability and motor ability and therefore makes the predictive models explainable. The selected finger-tapping attributes can be considered as dementia biomarkers. △ Less

Submitted 15 November, 2021; v1 submitted 18 April, 2020; originally announced April 2020.

Comments: 11 pages, 4 figures, 2 tables

Journal ref: Proceedings of 2021 Chinese Intelligent Automation Conference. Lecture Notes in Electrical Engineering, vol 801

arXiv:2003.00875 [pdf]

Predicting TUG score from gait characteristics with video analysis and machine learning

Authors: Jian Ma

Abstract: Fall is a leading cause of death which suffers the elderly and society. Timed Up and Go (TUG) test is a common tool for fall risk assessment. In this paper, we propose a method for predicting TUG score from gait characteristics extracted from video with computer vision and machine learning technologies. First, 3D pose is estimated from video captured with 2D and 3D cameras during human motion and… ▽ More Fall is a leading cause of death which suffers the elderly and society. Timed Up and Go (TUG) test is a common tool for fall risk assessment. In this paper, we propose a method for predicting TUG score from gait characteristics extracted from video with computer vision and machine learning technologies. First, 3D pose is estimated from video captured with 2D and 3D cameras during human motion and then a group of gait characteristics are computed from 3D pose series. After that, copula entropy is used to select those characteristics which are mostly associated with TUG score. Finally, the selected characteristics are fed into the predictive models to predict TUG score. Experiments on real world data demonstrated the effectiveness of the proposed method. As a byproduct, the associations between TUG score and several gait characteristics are discovered, which laid the scientific foundation of the proposed method and make the predictive models such built interpretable to clinical users. △ Less

Submitted 28 April, 2020; v1 submitted 23 February, 2020; originally announced March 2020.

Comments: Experimental results and discussion are revised. The code for estimating copula entropy is available at https://github.com/majianthu/copent

arXiv:2001.03985 [pdf, other]

doi 10.1371/journal.pcbi.1008483

Unbiased and Efficient Log-Likelihood Estimation with Inverse Binomial Sampling

Authors: Bas van Opheusden, Luigi Acerbi, Wei Ji Ma

Abstract: The fate of scientific hypotheses often relies on the ability of a computational model to explain the data, quantified in modern statistical approaches by the likelihood function. The log-likelihood is the key element for parameter estimation and model evaluation. However, the log-likelihood of complex models in fields such as computational biology and neuroscience is often intractable to compute… ▽ More The fate of scientific hypotheses often relies on the ability of a computational model to explain the data, quantified in modern statistical approaches by the likelihood function. The log-likelihood is the key element for parameter estimation and model evaluation. However, the log-likelihood of complex models in fields such as computational biology and neuroscience is often intractable to compute analytically or numerically. In those cases, researchers can often only estimate the log-likelihood by comparing observed data with synthetic observations generated by model simulations. Standard techniques to approximate the likelihood via simulation either use summary statistics of the data or are at risk of producing severe biases in the estimate. Here, we explore another method, inverse binomial sampling (IBS), which can estimate the log-likelihood of an entire data set efficiently and without bias. For each observation, IBS draws samples from the simulator model until one matches the observation. The log-likelihood estimate is then a function of the number of samples drawn. The variance of this estimator is uniformly bounded, achieves the minimum variance for an unbiased estimator, and we can compute calibrated estimates of the variance. We provide theoretical arguments in favor of IBS and an empirical assessment of the method for maximum-likelihood estimation with simulation-based models. As case studies, we take three model-fitting problems of increasing complexity from computational and cognitive neuroscience. In all problems, IBS generally produces lower error in the estimated parameters and maximum log-likelihood values than alternative sampling methods with the same average number of samples. Our results demonstrate the potential of IBS as a practical, robust, and easy to implement method for log-likelihood evaluation when exact techniques are not available. △ Less

Submitted 27 October, 2020; v1 submitted 12 January, 2020; originally announced January 2020.

Comments: Bas van Opheusden and Luigi Acerbi contributed equally to this work

arXiv:2001.00692 [pdf]

FFusionCGAN: An end-to-end fusion method for few-focus images using conditional GAN in cytopathological digital slides

Authors: Xiebo Geng, Sibo Liua, Wei Han, Xu Li, Jiabo Ma, Jingya Yu, Xiuli Liu, Sahoqun Zeng, Li Chen, Shenghua Cheng

Abstract: Multi-focus image fusion technologies compress different focus depth images into an image in which most objects are in focus. However, although existing image fusion techniques, including traditional algorithms and deep learning-based algorithms, can generate high-quality fused images, they need multiple images with different focus depths in the same field of view. This criterion may not be met in… ▽ More Multi-focus image fusion technologies compress different focus depth images into an image in which most objects are in focus. However, although existing image fusion techniques, including traditional algorithms and deep learning-based algorithms, can generate high-quality fused images, they need multiple images with different focus depths in the same field of view. This criterion may not be met in some cases where time efficiency is required or the hardware is insufficient. The problem is especially prominent in large-size whole slide images. This paper focused on the multi-focus image fusion of cytopathological digital slide images, and proposed a novel method for generating fused images from single-focus or few-focus images based on conditional generative adversarial network (GAN). Through the adversarial learning of the generator and discriminator, the method is capable of generating fused images with clear textures and large depth of field. Combined with the characteristics of cytopathological images, this paper designs a new generator architecture combining U-Net and DenseBlock, which can effectively improve the network's receptive field and comprehensively encode image features. Meanwhile, this paper develops a semantic segmentation network that identifies the blurred regions in cytopathological images. By integrating the network into the generative model, the quality of the generated fused images is effectively improved. Our method can generate fused images from only single-focus or few-focus images, thereby avoiding the problem of collecting multiple images of different focus depths with increased time and hardware costs. Furthermore, our model is designed to learn the direct mapping of input source images to fused images without the need to manually design complex activity level measurements and fusion rules as in traditional methods. △ Less

Submitted 2 January, 2020; originally announced January 2020.

arXiv:1912.01769 [pdf]

doi 10.1016/j.ejps.2019.105199

Population Pharmacokinetic Study of Tacrolimus in Pediatric Patients with Primary Nephrotic Syndrome: A Comparison of Linear and Nonlinear Michaelis Menten Pharmacokinetic Model

Authors: Lingfei Huang, Yixi Liu, Zheng Jiao, Junyan Wang, Luo Fang, Jianhua Mao

Abstract: Background Little is known about the population pharmacokinetics (PPK) of tacrolimus (TAC) in pediatric primary nephrotic syndrome (PNS). This study aimed to compare the predictive performance between nonlinear and linear PK models and investigate the significant factors of TAC PK characteristics in pediatric PNS. Methods Data were obtained from 71 pediatric patients with PNS, along with 525 TAC t… ▽ More Background Little is known about the population pharmacokinetics (PPK) of tacrolimus (TAC) in pediatric primary nephrotic syndrome (PNS). This study aimed to compare the predictive performance between nonlinear and linear PK models and investigate the significant factors of TAC PK characteristics in pediatric PNS. Methods Data were obtained from 71 pediatric patients with PNS, along with 525 TAC trough concentrations at steady state. The demographic, medical, and treatment details were collected. Genetic polymorphisms were analyzed. The PPK models were developed using nonlinear mixed effects model software. Two modeling strategies, linear compartmental and nonlinear Michaelis Menten (MM) models, were evaluated and compared. Results Body weight, age, daily dose of TAC, co-therapy drugs (including azole antifungal agents and diltiazem), and CYP3A5*3 genotype were important factors in the final linear model (onecompartment model), whereas only body weight, codrugs, and CYP3A5*3 genotype were the important factors in the nonlinear MM model. Apparent clearance and volume of distribution in the final linear model were 7.13 L/h and 142 L, respectively. The maximal dose rate (Vmax) of the nonlinear MM model was 1.92 mg/day and the average concentration at steady state at half-Vmax (Km) was 1.98 ng/mL. The nonlinear model described the data better than the linear model. Dosing regimens were proposed based on the nonlinear PK model.Conclusion Our findings demonstrate that the nonlinear MM model showed better predictive performance than the linear compartmental model, providing reliable support for optimizing TAC dosing and adjustment in children with PNS. △ Less

Submitted 26 February, 2020; v1 submitted 3 December, 2019; originally announced December 2019.

Comments: 22 pages, 4 tables and 4 figures

Journal ref: Eur J Pharm Sci. 2020 Feb 15;143:105199

arXiv:1907.12268 [pdf, other]

Discovering Association with Copula Entropy

Authors: Jian Ma

Abstract: Discovering associations is of central importance in scientific practices. Currently, most researches consider only linear association measured by correlation coefficient, which has its theoretical limitations. In this paper, we propose a new method for discovering association with copula entropy -- a universal applicable association measure for not only linear cases, but nonlinear cases. The adva… ▽ More Discovering associations is of central importance in scientific practices. Currently, most researches consider only linear association measured by correlation coefficient, which has its theoretical limitations. In this paper, we propose a new method for discovering association with copula entropy -- a universal applicable association measure for not only linear cases, but nonlinear cases. The advantage of the method based on copula entropy over traditional method is demonstrated on the NHANES data by discovering more biomedical meaningful associations. △ Less

Submitted 14 April, 2020; v1 submitted 29 July, 2019; originally announced July 2019.

Comments: Minor revision. The code is available at https://github.com/majianthu/copent

Showing 1–50 of 72 results for author: Ma, J