Search | arXiv e-print repository

3D MR Fingerprinting for Dynamic Contrast-Enhanced Imaging of Whole Mouse Brain

Authors: Yuran Zhu, Guanhua Wang, Yuning Gu, Walter Zhao, Jiahao Lu, Junqing Zhu, Christina J. MacAskill, Andrew Dupuis, Mark A. Griswold, Dan Ma, Chris A. Flask, Xin Yu

Abstract: Quantitative MRI enables direct quantification of contrast agent concentrations in contrast-enhanced scans. However, the lengthy scan times required by conventional methods are inadequate for tracking contrast agent transport dynamically in mouse brain. We developed a 3D MR fingerprinting (MRF) method for simultaneous T1 and T2 mapping across the whole mouse brain with 4.3-min temporal resolution.… ▽ More Quantitative MRI enables direct quantification of contrast agent concentrations in contrast-enhanced scans. However, the lengthy scan times required by conventional methods are inadequate for tracking contrast agent transport dynamically in mouse brain. We developed a 3D MR fingerprinting (MRF) method for simultaneous T1 and T2 mapping across the whole mouse brain with 4.3-min temporal resolution. We designed a 3D MRF sequence with variable acquisition segment lengths and magnetization preparations on a 9.4T preclinical MRI scanner. Model-based reconstruction approaches were employed to improve the accuracy and speed of MRF acquisition. The method's accuracy for T1 and T2 measurements was validated in vitro, while its repeatability of T1 and T2 measurements was evaluated in vivo (n=3). The utility of the 3D MRF sequence for dynamic tracking of intracisternally infused Gd-DTPA in the whole mouse brain was demonstrated (n=5). Phantom studies confirmed accurate T1 and T2 measurements by 3D MRF with an undersampling factor up to 48. Dynamic contrast-enhanced (DCE) MRF scans achieved a spatial resolution of 192 x 192 x 500 um3 and a temporal resolution of 4.3 min, allowing for the analysis and comparison of dynamic changes in concentration and transport kinetics of intracisternally infused Gd-DTPA across brain regions. The sequence also enabled highly repeatable, high-resolution T1 and T2 mapping of the whole mouse brain (192 x 192 x 250 um3) in 30 min. We present the first dynamic and multi-parametric approach for quantitatively tracking contrast agent transport in the mouse brain using 3D MRF. △ Less

Submitted 1 May, 2024; originally announced May 2024.

arXiv:2403.12284 [pdf, other]

The Wreaths of KHAN: Uniform Graph Feature Selection with False Discovery Rate Control

Authors: Jiajun Liang, Yue Liu, Doudou Zhou, Sinian Zhang, Junwei Lu

Abstract: Graphical models find numerous applications in biology, chemistry, sociology, neuroscience, etc. While substantial progress has been made in graph estimation, it remains largely unexplored how to select significant graph signals with uncertainty assessment, especially those graph features related to topological structures including cycles (i.e., wreaths), cliques, hubs, etc. These features play a… ▽ More Graphical models find numerous applications in biology, chemistry, sociology, neuroscience, etc. While substantial progress has been made in graph estimation, it remains largely unexplored how to select significant graph signals with uncertainty assessment, especially those graph features related to topological structures including cycles (i.e., wreaths), cliques, hubs, etc. These features play a vital role in protein substructure analysis, drug molecular design, and brain network connectivity analysis. To fill the gap, we propose a novel inferential framework for general high dimensional graphical models to select graph features with false discovery rate controlled. Our method is based on the maximum of $p$-values from single edges that comprise the topological feature of interest, thus is able to detect weak signals. Moreover, we introduce the $K$-dimensional persistent Homology Adaptive selectioN (KHAN) algorithm to select all the homological features within $K$ dimensions with the uniform control of the false discovery rate over continuous filtration levels. The KHAN method applies a novel discrete Gram-Schmidt algorithm to select statistically significant generators from the homology group. We apply the structural screening method to identify the important residues of the SARS-CoV-2 spike protein during the binding process to the ACE2 receptors. We score the residues for all domains in the spike protein by the $p$-value weighted filtration level in the network persistent homology for the closed, partially open, and open states and identify the residues crucial for protein conformational changes and thus being potential targets for inhibition. △ Less

Submitted 18 March, 2024; originally announced March 2024.

arXiv:2403.03274 [pdf, other]

From Noise to Signal: Unveiling Treatment Effects from Digital Health Data through Pharmacology-Informed Neural-SDE

Authors: Samira Pakravan, Nikolaos Evangelou, Maxime Usdin, Logan Brooks, James Lu

Abstract: Digital health technologies (DHT), such as wearable devices, provide personalized, continuous, and real-time monitoring of patient. These technologies are contributing to the development of novel therapies and personalized medicine. Gaining insight from these technologies requires appropriate modeling techniques to capture clinically-relevant changes in disease state. The data generated from these… ▽ More Digital health technologies (DHT), such as wearable devices, provide personalized, continuous, and real-time monitoring of patient. These technologies are contributing to the development of novel therapies and personalized medicine. Gaining insight from these technologies requires appropriate modeling techniques to capture clinically-relevant changes in disease state. The data generated from these devices is characterized by being stochastic in nature, may have missing elements, and exhibits considerable inter-individual variability - thereby making it difficult to analyze using traditional longitudinal modeling techniques. We present a novel pharmacology-informed neural stochastic differential equation (SDE) model capable of addressing these challenges. Using synthetic data, we demonstrate that our approach is effective in identifying treatment effects and learning causal relationships from stochastic data, thereby enabling counterfactual simulation. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 6 figures

ACM Class: I.2; G.3

arXiv:2402.12391 [pdf, other]

Toward a Team of AI-made Scientists for Scientific Discovery from Gene Expression Data

Authors: Haoyang Liu, Yijiang Li, Jinglin Jian, Yuxuan Cheng, Jianrong Lu, Shuyi Guo, Jinglei Zhu, Mianchen Zhang, Miantong Zhang, Haohan Wang

Abstract: Machine learning has emerged as a powerful tool for scientific discovery, enabling researchers to extract meaningful insights from complex datasets. For instance, it has facilitated the identification of disease-predictive genes from gene expression data, significantly advancing healthcare. However, the traditional process for analyzing such datasets demands substantial human effort and expertise… ▽ More Machine learning has emerged as a powerful tool for scientific discovery, enabling researchers to extract meaningful insights from complex datasets. For instance, it has facilitated the identification of disease-predictive genes from gene expression data, significantly advancing healthcare. However, the traditional process for analyzing such datasets demands substantial human effort and expertise for the data selection, processing, and analysis. To address this challenge, we introduce a novel framework, a Team of AI-made Scientists (TAIS), designed to streamline the scientific discovery pipeline. TAIS comprises simulated roles, including a project manager, data engineer, and domain expert, each represented by a Large Language Model (LLM). These roles collaborate to replicate the tasks typically performed by data scientists, with a specific focus on identifying disease-predictive genes. Furthermore, we have curated a benchmark dataset to assess TAIS's effectiveness in gene identification, demonstrating our system's potential to significantly enhance the efficiency and scope of scientific exploration. Our findings represent a solid step towards automating scientific discovery through large language models. △ Less

Submitted 20 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

Comments: 18 pages, 2 figures; added contact

arXiv:2402.10433 [pdf, other]

Fusing Neural and Physical: Augment Protein Conformation Sampling with Tractable Simulations

Authors: Jiarui Lu, Zuobai Zhang, Bozitao Zhong, Chence Shi, Jian Tang

Abstract: The protein dynamics are common and important for their biological functions and properties, the study of which usually involves time-consuming molecular dynamics (MD) simulations in silico. Recently, generative models has been leveraged as a surrogate sampler to obtain conformation ensembles with orders of magnitude faster and without requiring any simulation data (a "zero-shot" inference). Howev… ▽ More The protein dynamics are common and important for their biological functions and properties, the study of which usually involves time-consuming molecular dynamics (MD) simulations in silico. Recently, generative models has been leveraged as a surrogate sampler to obtain conformation ensembles with orders of magnitude faster and without requiring any simulation data (a "zero-shot" inference). However, being agnostic of the underlying energy landscape, the accuracy of such generative model may still be limited. In this work, we explore the few-shot setting of such pre-trained generative sampler which incorporates MD simulations in a tractable manner. Specifically, given a target protein of interest, we first acquire some seeding conformations from the pre-trained sampler followed by a number of physical simulations in parallel starting from these seeding samples. Then we fine-tuned the generative model using the simulation trajectories above to become a target-specific sampler. Experimental results demonstrated the superior performance of such few-shot conformation sampler at a tractable computational cost. △ Less

Submitted 11 March, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

Comments: Published at the GEM workshop, ICLR 2024

arXiv:2402.07955 [pdf, other]

ProtIR: Iterative Refinement between Retrievers and Predictors for Protein Function Annotation

Authors: Zuobai Zhang, Jiarui Lu, Vijil Chenthamarakshan, Aurélie Lozano, Payel Das, Jian Tang

Abstract: Protein function annotation is an important yet challenging task in biology. Recent deep learning advancements show significant potential for accurate function prediction by learning from protein sequences and structures. Nevertheless, these predictor-based methods often overlook the modeling of protein similarity, an idea commonly employed in traditional approaches using sequence or structure ret… ▽ More Protein function annotation is an important yet challenging task in biology. Recent deep learning advancements show significant potential for accurate function prediction by learning from protein sequences and structures. Nevertheless, these predictor-based methods often overlook the modeling of protein similarity, an idea commonly employed in traditional approaches using sequence or structure retrieval tools. To fill this gap, we first study the effect of inter-protein similarity modeling by benchmarking retriever-based methods against predictors on protein function annotation tasks. Our results show that retrievers can match or outperform predictors without large-scale pre-training. Building on these insights, we introduce a novel variational pseudo-likelihood framework, ProtIR, designed to improve function predictors by incorporating inter-protein similarity modeling. This framework iteratively refines knowledge between a function predictor and retriever, thereby combining the strengths of both predictors and retrievers. ProtIR showcases around 10% improvement over vanilla predictor-based methods. Besides, it achieves performance on par with protein language model-based methods, yet without the need for massive pre-training, highlighting the efficacy of our framework. Code will be released upon acceptance. △ Less

Submitted 10 February, 2024; originally announced February 2024.

arXiv:2402.05856 [pdf, other]

Structure-Informed Protein Language Model

Authors: Zuobai Zhang, Jiarui Lu, Vijil Chenthamarakshan, Aurélie Lozano, Payel Das, Jian Tang

Abstract: Protein language models are a powerful tool for learning protein representations through pre-training on vast protein sequence datasets. However, traditional protein language models lack explicit structural supervision, despite its relevance to protein function. To address this issue, we introduce the integration of remote homology detection to distill structural information into protein language… ▽ More Protein language models are a powerful tool for learning protein representations through pre-training on vast protein sequence datasets. However, traditional protein language models lack explicit structural supervision, despite its relevance to protein function. To address this issue, we introduce the integration of remote homology detection to distill structural information into protein language models without requiring explicit protein structures as input. We evaluate the impact of this structure-informed training on downstream protein function prediction tasks. Experimental results reveal consistent improvements in function annotation accuracy for EC number and GO term prediction. Performance on mutant datasets, however, varies based on the relationship between targeted properties and protein structures. This underscores the importance of considering this relationship when applying structure-aware training to protein function prediction tasks. Code and model weights are available at https://github.com/DeepGraphLearning/esm-s. △ Less

Submitted 7 February, 2024; originally announced February 2024.

arXiv:2402.00024 [pdf, other]

Can Large Language Models Understand Molecules?

Authors: Shaghayegh Sadeghi, Alan Bui, Ali Forooghi, Jianguo Lu, Alioune Ngom

Abstract: Purpose: Large Language Models (LLMs) like GPT (Generative Pre-trained Transformer) from OpenAI and LLaMA (Large Language Model Meta AI) from Meta AI are increasingly recognized for their potential in the field of cheminformatics, particularly in understanding Simplified Molecular Input Line Entry System (SMILES), a standard method for representing chemical structures. These LLMs also have the abi… ▽ More Purpose: Large Language Models (LLMs) like GPT (Generative Pre-trained Transformer) from OpenAI and LLaMA (Large Language Model Meta AI) from Meta AI are increasingly recognized for their potential in the field of cheminformatics, particularly in understanding Simplified Molecular Input Line Entry System (SMILES), a standard method for representing chemical structures. These LLMs also have the ability to decode SMILES strings into vector representations. Method: We investigate the performance of GPT and LLaMA compared to pre-trained models on SMILES in embedding SMILES strings on downstream tasks, focusing on two key applications: molecular property prediction and drug-drug interaction prediction. Results: We find that SMILES embeddings generated using LLaMA outperform those from GPT in both molecular property and DDI prediction tasks. Notably, LLaMA-based SMILES embeddings show results comparable to pre-trained models on SMILES in molecular prediction tasks and outperform the pre-trained models for the DDI prediction tasks. Conclusion: The performance of LLMs in generating SMILES embeddings shows great potential for further investigation of these models for molecular embedding. We hope our study bridges the gap between LLMs and molecular embedding, motivating additional research into the potential of LLMs in the molecular representation field. GitHub: https://github.com/sshaghayeghs/LLaMA-VS-GPT △ Less

Submitted 20 May, 2024; v1 submitted 5 January, 2024; originally announced February 2024.

arXiv:2401.17123 [pdf, other]

Unsupervised Discovery of Steerable Factors When Graph Deep Generative Models Are Entangled

Authors: Shengchao Liu, Chengpeng Wang, Jiarui Lu, Weili Nie, Hanchen Wang, Zhuoxinran Li, Bolei Zhou, Jian Tang

Abstract: Deep generative models (DGMs) have been widely developed for graph data. However, much less investigation has been carried out on understanding the latent space of such pretrained graph DGMs. These understandings possess the potential to provide constructive guidelines for crucial tasks, such as graph controllable generation. Thus in this work, we are interested in studying this problem and propos… ▽ More Deep generative models (DGMs) have been widely developed for graph data. However, much less investigation has been carried out on understanding the latent space of such pretrained graph DGMs. These understandings possess the potential to provide constructive guidelines for crucial tasks, such as graph controllable generation. Thus in this work, we are interested in studying this problem and propose GraphCG, a method for the unsupervised discovery of steerable factors in the latent space of pretrained graph DGMs. We first examine the representation space of three pretrained graph DGMs with six disentanglement metrics, and we observe that the pretrained representation space is entangled. Motivated by this observation, GraphCG learns the steerable factors via maximizing the mutual information between semantic-rich directions, where the controlled graph moving along the same direction will share the same steerable factors. We quantitatively verify that GraphCG outperforms four competitive baselines on two graph DGMs pretrained on two molecule datasets. Additionally, we qualitatively illustrate seven steerable factors learned by GraphCG on five pretrained DGMs over five graph datasets, including two for molecules and three for point clouds. △ Less

Submitted 29 January, 2024; originally announced January 2024.

arXiv:2308.01362 [pdf]

Explainable Deep Learning for Tumor Dynamic Modeling and Overall Survival Prediction using Neural-ODE

Authors: Mark Laurie, James Lu

Abstract: While tumor dynamic modeling has been widely applied to support the development of oncology drugs, there remains a need to increase predictivity, enable personalized therapy, and improve decision-making. We propose the use of Tumor Dynamic Neural-ODE (TDNODE) as a pharmacology-informed neural network to enable model discovery from longitudinal tumor size data. We show that TDNODE overcomes a key l… ▽ More While tumor dynamic modeling has been widely applied to support the development of oncology drugs, there remains a need to increase predictivity, enable personalized therapy, and improve decision-making. We propose the use of Tumor Dynamic Neural-ODE (TDNODE) as a pharmacology-informed neural network to enable model discovery from longitudinal tumor size data. We show that TDNODE overcomes a key limitation of existing models in its ability to make unbiased predictions from truncated data. The encoder-decoder architecture is designed to express an underlying dynamical law which possesses the fundamental property of generalized homogeneity with respect to time. Thus, the modeling formalism enables the encoder output to be interpreted as kinetic rate metrics, with inverse time as the physical unit. We show that the generated metrics can be used to predict patients' overall survival (OS) with high accuracy. The proposed modeling formalism provides a principled way to integrate multimodal dynamical datasets in oncology disease modeling. △ Less

Submitted 20 October, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

Comments: 33 pages, 4 Figures and 2 Tables. Includes Supplementary Materials

MSC Class: 92-10 ACM Class: I.2.6

arXiv:2306.03117 [pdf, other]

Str2Str: A Score-based Framework for Zero-shot Protein Conformation Sampling

Authors: Jiarui Lu, Bozitao Zhong, Zuobai Zhang, Jian Tang

Abstract: The dynamic nature of proteins is crucial for determining their biological functions and properties, for which Monte Carlo (MC) and molecular dynamics (MD) simulations stand as predominant tools to study such phenomena. By utilizing empirically derived force fields, MC or MD simulations explore the conformational space through numerically evolving the system via Markov chain or Newtonian mechanics… ▽ More The dynamic nature of proteins is crucial for determining their biological functions and properties, for which Monte Carlo (MC) and molecular dynamics (MD) simulations stand as predominant tools to study such phenomena. By utilizing empirically derived force fields, MC or MD simulations explore the conformational space through numerically evolving the system via Markov chain or Newtonian mechanics. However, the high-energy barrier of the force fields can hamper the exploration of both methods by the rare event, resulting in inadequately sampled ensemble without exhaustive running. Existing learning-based approaches perform direct sampling yet heavily rely on target-specific simulation data for training, which suffers from high data acquisition cost and poor generalizability. Inspired by simulated annealing, we propose Str2Str, a novel structure-to-structure translation framework capable of zero-shot conformation sampling with roto-translation equivariant property. Our method leverages an amortized denoising score matching objective trained on general crystal structures and has no reliance on simulation data during both training and inference. Experimental results across several benchmarking protein systems demonstrate that Str2Str outperforms previous state-of-the-art generative structure prediction models and can be orders of magnitude faster compared to long MD simulations. Our open-source implementation is available at https://github.com/lujiarui/Str2Str △ Less

Submitted 11 March, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

Comments: Published as a conference paper at ICLR 2024, see https://openreview.net/forum?id=C4BikKsgmK

arXiv:2305.12617 [pdf]

Energy landscape reveals the underlying mechanism of cancer-adipose conversion with gene network models

Authors: Zihao Chen, Jia Lu, Xing-Ming Zhao, Haiyang Yu, Chunhe Li

Abstract: Cancer is a systemic heterogeneous disease involving complex molecular networks. Tumor formation involves epithelial-mesenchymal transition (EMT), which promotes both metastasis and plasticity of cancer cells. Recent experiments proposed that cancer cells can be transformed into adipocytes with combination drugs. However, the underlying mechanisms for how these drugs work from molecular network pe… ▽ More Cancer is a systemic heterogeneous disease involving complex molecular networks. Tumor formation involves epithelial-mesenchymal transition (EMT), which promotes both metastasis and plasticity of cancer cells. Recent experiments proposed that cancer cells can be transformed into adipocytes with combination drugs. However, the underlying mechanisms for how these drugs work from molecular network perspective remain elusive. To reveal the mechanism of cancer-adipose conversion (CAC), we adopt a systems biology approach by combing mathematical modeling and molecular experiments based on the underlying molecular regulatory network. We identified four types of attractors which correspond to epithelial (E), mesenchymal (M), adipose (A) and partial/intermediate EMT (P) cell states on the CAC landscape. Landscape and transition path results illustrate that the intermediate states play critical roles in cancer to adipose transition. Through a landscape control strategy, we identified two new therapeutic strategies for drug combinations to promote CAC. We further verified these predictions by molecular experiments in different cell lines. Our combined computational and experimental approach provides a powerful tool to explore molecular mechanisms for cell fate transitions in cancer networks. Our results revealed the underlying mechanism for intermediate cell states governing the CAC, and identified new potential drug combinations to induce cancer adipogenesis. △ Less

Submitted 21 May, 2023; originally announced May 2023.

Comments: 35 pages, 5 figures

arXiv:2304.00687 [pdf]

Computational Validation of a Mathematical Model of Stable Multi-Species Communities in a Hawk Dove Game

Authors: Jeffrey Lu

Abstract: We revisit the original hawk-dove game with slight modifications to payoff values while maintaining the fundamental principles of interaction. The practical robustness of the theoretical tools of game theory is tested on a simulated population of hawks and doves with varying initial population distributions and peak growth rates. Additionally, we aim to find conditions in which the entire communit… ▽ More We revisit the original hawk-dove game with slight modifications to payoff values while maintaining the fundamental principles of interaction. The practical robustness of the theoretical tools of game theory is tested on a simulated population of hawks and doves with varying initial population distributions and peak growth rates. Additionally, we aim to find conditions in which the entire community fails or becomes a single-species population. The results show that the predicted community distribution is established by the majority of communities but fails to exist in communities with extreme initial imbalances in species distribution and insufficient growth rates. We also find that greater growth rates can compensate for more imbalanced initial conditions and that more balanced initial conditions can compensate for lower growth rates. Overall, the simple theoretical model is a strong predictor of the stable behavior of simulated multi-species communities. △ Less

Submitted 2 April, 2023; originally announced April 2023.

MSC Class: 91A22

arXiv:2302.04611 [pdf, other]

A Text-guided Protein Design Framework

Authors: Shengchao Liu, Yanjing Li, Zhuoxinran Li, Anthony Gitter, Yutao Zhu, Jiarui Lu, Zhao Xu, Weili Nie, Arvind Ramanathan, Chaowei Xiao, Jian Tang, Hongyu Guo, Anima Anandkumar

Abstract: Current AI-assisted protein design mainly utilizes protein sequential and structural information. Meanwhile, there exists tremendous knowledge curated by humans in the text format describing proteins' high-level functionalities. Yet, whether the incorporation of such text data can help protein design tasks has not been explored. To bridge this gap, we propose ProteinDT, a multi-modal framework tha… ▽ More Current AI-assisted protein design mainly utilizes protein sequential and structural information. Meanwhile, there exists tremendous knowledge curated by humans in the text format describing proteins' high-level functionalities. Yet, whether the incorporation of such text data can help protein design tasks has not been explored. To bridge this gap, we propose ProteinDT, a multi-modal framework that leverages textual descriptions for protein design. ProteinDT consists of three subsequent steps: ProteinCLAP which aligns the representation of two modalities, a facilitator that generates the protein representation from the text modality, and a decoder that creates the protein sequences from the representation. To train ProteinDT, we construct a large dataset, SwissProtCLAP, with 441K text and protein pairs. We quantitatively verify the effectiveness of ProteinDT on three challenging tasks: (1) over 90\% accuracy for text-guided protein generation; (2) best hit ratio on 10 zero-shot text-guided protein editing tasks; (3) superior performance on four out of six protein property prediction benchmarks. △ Less

Submitted 3 December, 2023; v1 submitted 9 February, 2023; originally announced February 2023.

arXiv:2212.10789 [pdf, other]

Multi-modal Molecule Structure-text Model for Text-based Retrieval and Editing

Authors: Shengchao Liu, Weili Nie, Chengpeng Wang, Jiarui Lu, Zhuoran Qiao, Ling Liu, Jian Tang, Chaowei Xiao, Anima Anandkumar

Abstract: There is increasing adoption of artificial intelligence in drug discovery. However, existing studies use machine learning to mainly utilize the chemical structures of molecules but ignore the vast textual knowledge available in chemistry. Incorporating textual knowledge enables us to realize new drug design objectives, adapt to text-based instructions and predict complex biological activities. Her… ▽ More There is increasing adoption of artificial intelligence in drug discovery. However, existing studies use machine learning to mainly utilize the chemical structures of molecules but ignore the vast textual knowledge available in chemistry. Incorporating textual knowledge enables us to realize new drug design objectives, adapt to text-based instructions and predict complex biological activities. Here we present a multi-modal molecule structure-text model, MoleculeSTM, by jointly learning molecules' chemical structures and textual descriptions via a contrastive learning strategy. To train MoleculeSTM, we construct a large multi-modal dataset, namely, PubChemSTM, with over 280,000 chemical structure-text pairs. To demonstrate the effectiveness and utility of MoleculeSTM, we design two challenging zero-shot tasks based on text instructions, including structure-text retrieval and molecule editing. MoleculeSTM has two main properties: open vocabulary and compositionality via natural language. In experiments, MoleculeSTM obtains the state-of-the-art generalization ability to novel biochemical concepts across various benchmarks. △ Less

Submitted 29 January, 2024; v1 submitted 21 December, 2022; originally announced December 2022.

arXiv:2212.00555 [pdf, other]

A Structure-guided Effective and Temporal-lag Connectivity Network for Revealing Brain Disorder Mechanisms

Authors: Zhengwang Xia, Tao Zhou, Saqib Mamoon, Amani Alfakih, Jianfeng Lu

Abstract: Brain network provides important insights for the diagnosis of many brain disorders, and how to effectively model the brain structure has become one of the core issues in the domain of brain imaging analysis. Recently, various computational methods have been proposed to estimate the causal relationship (i.e., effective connectivity) between brain regions. Compared with traditional correlation-base… ▽ More Brain network provides important insights for the diagnosis of many brain disorders, and how to effectively model the brain structure has become one of the core issues in the domain of brain imaging analysis. Recently, various computational methods have been proposed to estimate the causal relationship (i.e., effective connectivity) between brain regions. Compared with traditional correlation-based methods, effective connectivity can provide the direction of information flow, which may provide additional information for the diagnosis of brain diseases. However, existing methods either ignore the fact that there is a temporal-lag in the information transmission across brain regions, or simply set the temporal-lag value between all brain regions to a fixed value. To overcome these issues, we design an effective temporal-lag neural network (termed ETLN) to simultaneously infer the causal relationships and the temporal-lag values between brain regions, which can be trained in an end-to-end manner. In addition, we also introduce three mechanisms to better guide the modeling of brain networks. The evaluation results on the Alzheimer's Disease Neuroimaging Initiative (ADNI) database demonstrate the effectiveness of the proposed method. △ Less

Submitted 1 December, 2022; originally announced December 2022.

arXiv:2210.08761 [pdf, other]

Protein Sequence and Structure Co-Design with Equivariant Translation

Authors: Chence Shi, Chuanrui Wang, Jiarui Lu, Bozitao Zhong, Jian Tang

Abstract: Proteins are macromolecules that perform essential functions in all living organisms. Designing novel proteins with specific structures and desired functions has been a long-standing challenge in the field of bioengineering. Existing approaches generate both protein sequence and structure using either autoregressive models or diffusion models, both of which suffer from high inference costs. In thi… ▽ More Proteins are macromolecules that perform essential functions in all living organisms. Designing novel proteins with specific structures and desired functions has been a long-standing challenge in the field of bioengineering. Existing approaches generate both protein sequence and structure using either autoregressive models or diffusion models, both of which suffer from high inference costs. In this paper, we propose a new approach capable of protein sequence and structure co-design, which iteratively translates both protein sequence and structure into the desired state from random initialization, based on context features given a priori. Our model consists of a trigonometry-aware encoder that reasons geometrical constraints and interactions from context features, and a roto-translation equivariant decoder that translates protein sequence and structure interdependently. Notably, all protein amino acids are updated in one shot in each translation step, which significantly accelerates the inference process. Experimental results across multiple tasks show that our model outperforms previous state-of-the-art baselines by a large margin, and is able to design proteins of high fidelity as regards both sequence and structure, with running time orders of magnitude less than sampling-based methods. △ Less

Submitted 2 March, 2023; v1 submitted 17 October, 2022; originally announced October 2022.

Comments: Published as a conference paper at ICLR 2023, see https://openreview.net/forum?id=pRCMXcfdihq

arXiv:2202.01975 [pdf]

Performance of multilabel machine learning models and risk stratification schemas for predicting stroke and bleeding risk in patients with non-valvular atrial fibrillation

Authors: Juan Lu, Rebecca Hutchens, Joseph Hung, Mohammed Bennamoun, Brendan McQuillan, Tom Briffa, Ferdous Sohel, Kevin Murray, Jonathon Stewart, Benjamin Chow, Frank Sanfilippo, Girish Dwivedi

Abstract: Appropriate antithrombotic therapy for patients with atrial fibrillation (AF) requires assessment of ischemic stroke and bleeding risks. However, risk stratification schemas such as CHA2DS2-VASc and HAS-BLED have modest predictive capacity for patients with AF. Machine learning (ML) techniques may improve predictive performance and support decision-making for appropriate antithrombotic therapy. We… ▽ More Appropriate antithrombotic therapy for patients with atrial fibrillation (AF) requires assessment of ischemic stroke and bleeding risks. However, risk stratification schemas such as CHA2DS2-VASc and HAS-BLED have modest predictive capacity for patients with AF. Machine learning (ML) techniques may improve predictive performance and support decision-making for appropriate antithrombotic therapy. We compared the performance of multilabel ML models with the currently used risk scores for predicting outcomes in AF patients. Materials and Methods This was a retrospective cohort study of 9670 patients, mean age 76.9 years, 46% women, who were hospitalized with non-valvular AF, and had 1-year follow-up. The primary outcome was ischemic stroke and major bleeding admission. The secondary outcomes were all-cause death and event-free survival. The discriminant power of ML models was compared with clinical risk scores by the area under the curve (AUC). Risk stratification was assessed using the net reclassification index. Results Multilabel gradient boosting machine provided the best discriminant power for stroke, major bleeding, and death (AUC = 0.685, 0.709, and 0.765 respectively) compared to other ML models. It provided modest performance improvement for stroke compared to CHA2DS2-VASc (AUC = 0.652), but significantly improved major bleeding prediction compared to HAS-BLED (AUC = 0.522). It also had a much greater discriminant power for death compared with CHA2DS2-VASc (AUC = 0.606). Also, models identified additional risk features (such as hemoglobin level, renal function, etc.) for each outcome. Conclusions Multilabel ML models can outperform clinical risk stratification scores for predicting the risk of major bleeding and death in non-valvular AF patients. △ Less

Submitted 2 February, 2022; originally announced February 2022.

arXiv:2112.13210 [pdf, other]

doi 10.1016/j.cmpb.2021.106415

Explainable Artificial Intelligence for Pharmacovigilance: What Features Are Important When Predicting Adverse Outcomes?

Authors: Isaac Ronald Ward, Ling Wang, Juan lu, Mohammed Bennamoun, Girish Dwivedi, Frank M Sanfilippo

Abstract: Explainable Artificial Intelligence (XAI) has been identified as a viable method for determining the importance of features when making predictions using Machine Learning (ML) models. In this study, we created models that take an individual's health information (e.g. their drug history and comorbidities) as inputs, and predict the probability that the individual will have an Acute Coronary Syndrom… ▽ More Explainable Artificial Intelligence (XAI) has been identified as a viable method for determining the importance of features when making predictions using Machine Learning (ML) models. In this study, we created models that take an individual's health information (e.g. their drug history and comorbidities) as inputs, and predict the probability that the individual will have an Acute Coronary Syndrome (ACS) adverse outcome. Using XAI, we quantified the contribution that specific drugs had on these ACS predictions, thus creating an XAI-based technique for pharmacovigilance monitoring, using ACS as an example of the adverse outcome to detect. Individuals aged over 65 who were supplied Musculo-skeletal system (anatomical therapeutic chemical (ATC) class M) or Cardiovascular system (ATC class C) drugs between 1993 and 2009 were identified, and their drug histories, comorbidities, and other key features were extracted from linked Western Australian datasets. Multiple ML models were trained to predict if these individuals would have an ACS related adverse outcome (i.e., death or hospitalisation with a discharge diagnosis of ACS), and a variety of ML and XAI techniques were used to calculate which features -- specifically which drugs -- led to these predictions. The drug dispensing features for rofecoxib and celecoxib were found to have a greater than zero contribution to ACS related adverse outcome predictions (on average), and it was found that ACS related adverse outcomes can be predicted with 72% accuracy. Furthermore, the XAI libraries LIME and SHAP were found to successfully identify both important and unimportant features, with SHAP slightly outperforming LIME. ML models trained on linked administrative health datasets in tandem with XAI algorithms can successfully quantify feature importance, and with further development, could potentially be used as pharmacovigilance monitoring techniques. △ Less

Submitted 25 December, 2021; originally announced December 2021.

Comments: Comput Methods Programs Biomed. 2021 Nov;212:106415. Epub 2021 Sep 26

arXiv:2102.10538 [pdf]

Policy-Aware Mobility Model Explains the Growth of COVID-19 in Cities

Authors: Zhenyu Han, Fengli Xu, Yong Li, Tao Jiang, Depeng Jin, Jianhua Lu, James A. Evans

Abstract: With the continued spread of coronavirus, the task of forecasting distinctive COVID-19 growth curves in different cities, which remain inadequately explained by standard epidemiological models, is critical for medical supply and treatment. Predictions must take into account non-pharmaceutical interventions to slow the spread of coronavirus, including stay-at-home orders, social distancing, quarant… ▽ More With the continued spread of coronavirus, the task of forecasting distinctive COVID-19 growth curves in different cities, which remain inadequately explained by standard epidemiological models, is critical for medical supply and treatment. Predictions must take into account non-pharmaceutical interventions to slow the spread of coronavirus, including stay-at-home orders, social distancing, quarantine and compulsory mask-wearing, leading to reductions in intra-city mobility and viral transmission. Moreover, recent work associating coronavirus with human mobility and detailed movement data suggest the need to consider urban mobility in disease forecasts. Here we show that by incorporating intra-city mobility and policy adoption into a novel metapopulation SEIR model, we can accurately predict complex COVID-19 growth patterns in U.S. cities ($R^2$ = 0.990). Estimated mobility change due to policy interventions is consistent with empirical observation from Apple Mobility Trends Reports (Pearson's R = 0.872), suggesting the utility of model-based predictions where data are limited. Our model also reproduces urban "superspreading", where a few neighborhoods account for most secondary infections across urban space, arising from uneven neighborhood populations and heightened intra-city churn in popular neighborhoods. Therefore, our model can facilitate location-aware mobility reduction policy that more effectively mitigates disease transmission at similar social cost. Finally, we demonstrate our model can serve as a fine-grained analytic and simulation framework that informs the design of rational non-pharmaceutical interventions policies. △ Less

Submitted 21 February, 2021; originally announced February 2021.

arXiv:2102.00637 [pdf]

doi 10.1200/CCI.20.00172

Computing the Hazard Ratios Associated with Explanatory Variables Using Machine Learning Models of Survival Data

Authors: Sameer Sundrani, James Lu

Abstract: Purpose: The application of Cox Proportional Hazards (CoxPH) models to survival data and the derivation of Hazard Ratio (HR) is well established. While nonlinear, tree-based Machine Learning (ML) models have been developed and applied to the survival analysis, no methodology exists for computing HRs associated with explanatory variables from such models. We describe a novel way to compute HRs from… ▽ More Purpose: The application of Cox Proportional Hazards (CoxPH) models to survival data and the derivation of Hazard Ratio (HR) is well established. While nonlinear, tree-based Machine Learning (ML) models have been developed and applied to the survival analysis, no methodology exists for computing HRs associated with explanatory variables from such models. We describe a novel way to compute HRs from tree-based ML models using the Shapley additive explanation (SHAP) values, which is a locally accurate and consistent methodology to quantify explanatory variables' contribution to predictions. Methods: We used three sets of publicly available survival data consisting of patients with colon, breast or pan cancer and compared the performance of CoxPH to the state-of-art ML model, XGBoost. To compute the HR for explanatory variables from the XGBoost model, the SHAP values were exponentiated and the ratio of the means over the two subgroups calculated. The confidence interval was computed via bootstrapping the training data and generating the ML model 1000 times. Across the three data sets, we systematically compared HRs for all explanatory variables. Open-source libraries in Python and R were used in the analyses. Results: For the colon and breast cancer data sets, the performance of CoxPH and XGBoost were comparable and we showed good consistency in the computed HRs. In the pan-cancer dataset, we showed agreement in most variables but also an opposite finding in two of the explanatory variables between the CoxPH and XGBoost result. Subsequent Kaplan-Meier plots supported the finding of the XGBoost model. Conclusion: Enabling the derivation of HR from ML models can help to improve the identification of risk factors from complex survival datasets and enhance the prediction of clinical trial outcomes. △ Less

Submitted 1 February, 2021; originally announced February 2021.

Comments: 27 pages, 5 figures, 1 table

MSC Class: 92-08 ACM Class: J.3

Journal ref: JCO Clinical Cancer Informatics, 2021

arXiv:2011.07511 [pdf]

Wide-field Decodable Orthogonal Fingerprints of Single Nanoparticles Unlock Multiplexed Digital Assays

Authors: Jiayan Liao, Jiajia Zhou, Yiliao Song, Baolei Liu, Yinghui Chen, Fan Wang, Chaohao Chen, Jun Lin, Xueyuan Chen, Jie Lu, Dayong Jin

Abstract: The control in optical uniformity of single nanoparticles and tuning their diversity in orthogonal dimensions, dot to dot, holds the key to unlock nanoscience and applications. Here we report that the time-domain emissive profile from single upconversion nanoparticle, including the rising, decay and peak moment of the excited state population (T2 profile), can be arbitrarily tuned by upconversion… ▽ More The control in optical uniformity of single nanoparticles and tuning their diversity in orthogonal dimensions, dot to dot, holds the key to unlock nanoscience and applications. Here we report that the time-domain emissive profile from single upconversion nanoparticle, including the rising, decay and peak moment of the excited state population (T2 profile), can be arbitrarily tuned by upconversion schemes, including interfacial energy migration, concentration dependency, energy transfer, and isolation of surface quenchers. This allows us to significantly increase the coding capacity at the nanoscale. We further implement both time-resolved wide-field imaging and deep-learning techniques to decode these fingerprints, showing high accuracies at high throughput. These high-dimensional optical fingerprints provide a new horizon for applications spanning from sub-diffraction-limit data storage, security inks, to high-throughput single-molecule digital assays and super-resolution imaging. △ Less

Submitted 15 November, 2020; originally announced November 2020.

arXiv:2010.11769 [pdf]

doi 10.1038/s42256-021-00357-4

Deep learning prediction of patient response time course from early data via neural-pharmacokinetic/pharmacodynamic modeling

Authors: James Lu, Brendan Bender, Jin Y. Jin, Yuanfang Guan

Abstract: The longitudinal analysis of patient response time course following doses of therapeutics is currently performed using Pharmacokinetic/Pharmacodynamic (PK/PD) methodologies, which requires significant human experience and expertise in the modeling of dynamical systems. By utilizing recent advancements in deep learning, we show that the governing differential equations can be learnt directly from l… ▽ More The longitudinal analysis of patient response time course following doses of therapeutics is currently performed using Pharmacokinetic/Pharmacodynamic (PK/PD) methodologies, which requires significant human experience and expertise in the modeling of dynamical systems. By utilizing recent advancements in deep learning, we show that the governing differential equations can be learnt directly from longitudinal patient data. In particular, we propose a novel neural-PK/PD framework that combines key pharmacological principles with neural ordinary differential equations. We applied it to an analysis of drug concentration and platelet response from a clinical dataset consisting of over 600 patients. We show that the neural-PK/PD model improves upon a state-of-the-art model with respect to metrics for temporal prediction. Furthermore, by incorporating key PK/PD concepts into its architecture, the model can generalize and enable the simulations of patient responses to untested dosing regimens. These results demonstrate the potential of neural-PK/PD for automated predictive analytics of patient response time course. △ Less

Submitted 22 October, 2020; originally announced October 2020.

Comments: Nat Mach Intell (2021)

arXiv:1804.05351 [pdf]

doi 10.1007/s11517-018-1905-1

Intertrochanteric Fracture Visualization and Analysis Using a Map Projection Technique

Authors: Yucheng Fu, Rong Liu, Yang Liu, Jiawei Lu

Abstract: Understanding intertrochanteric fracture distribution is an important topic in orthopaedics due to its high morbidity and mortality. The intertrochanteric fracture can contain high-dimensional information including complicated 3D fracture lines, which often make it difficult to visualize or to obtain valuable statistics for clinical diagnosis and prognosis applications. This paper proposed a map p… ▽ More Understanding intertrochanteric fracture distribution is an important topic in orthopaedics due to its high morbidity and mortality. The intertrochanteric fracture can contain high-dimensional information including complicated 3D fracture lines, which often make it difficult to visualize or to obtain valuable statistics for clinical diagnosis and prognosis applications. This paper proposed a map projection technique to map the high-dimensional information into a 2D parametric space. This method can preserve the 3D proximal femur surface and structure while visualizing the entire fracture line with a single plot/view. Using this method and a standardization technique, a total of 100 patients with different ages and genders are studied based on the original radiographs acquired by CT scan. The comparison shows that the proposed map projection representation is more efficient and rich in information visualization than the conventional heat map technique. Using the proposed method, a fracture probability can be obtained at any location in the 2D parametric space, from which the most probable fracture region can be accurately identified. The study shows that age and gender have significant influences on intertrochanteric fracture frequency and fracture line distribution. △ Less

Submitted 13 October, 2018; v1 submitted 15 April, 2018; originally announced April 2018.

Comments: 17 pages, 10 figures, this article can be accessed via: https://rdcu.be/88ud, Med Biol Eng Comput (2018)

arXiv:1704.00793 [pdf, other]

Seeds Cleansing CNMF for Spatiotemporal Neural Signals Extraction of Miniscope Imaging Data

Authors: Jinghao Lu, Chunyuan Li, Fan Wang

Abstract: Miniscope calcium imaging is increasingly being used to monitor large populations of neuronal activities in freely behaving animals. However, due to the high background and low signal-to-noise ratio of the single-photon based imaging used in this technique, extraction of neural signals from the large numbers of imaged cells automatically has remained challenging. Here we describe a highly accurate… ▽ More Miniscope calcium imaging is increasingly being used to monitor large populations of neuronal activities in freely behaving animals. However, due to the high background and low signal-to-noise ratio of the single-photon based imaging used in this technique, extraction of neural signals from the large numbers of imaged cells automatically has remained challenging. Here we describe a highly accurate framework for automatically identifying activated neurons and extracting calcium signals from the miniscope imaging data, seeds cleansing Constrained Nonnegative Matrix Factorization (sc-CNMF). This sc-CNMF extends the conventional CNMF with two new modules: i) a neural enhancing module to overcome miniscope-specific limitations, and ii) a seeds cleansing module combining LSTM to rigorously select and cleanse the set of seeds for detecting regions-of-interest. Our sc-CNMF yields highly stable and superior performance in analyzing miniscope calcium imaging data compared to existing methods. △ Less

Submitted 3 April, 2017; originally announced April 2017.

Comments: 14 pages, 11 figures

arXiv:1204.6376 [pdf, ps, other]

The Landscape of Complex Networks

Authors: E. Weinan, Jianfeng Lu, Yuan Yao

Abstract: Topological landscape is introduced for networks with functions defined on the nodes. By extending the notion of gradient flows to the network setting, critical nodes of different indices are defined. This leads to a concise and hierarchical representation of the network. Persistent homology from computational topology is used to design efficient algorithms for performing such analysis. Applicatio… ▽ More Topological landscape is introduced for networks with functions defined on the nodes. By extending the notion of gradient flows to the network setting, critical nodes of different indices are defined. This leads to a concise and hierarchical representation of the network. Persistent homology from computational topology is used to design efficient algorithms for performing such analysis. Applications to some examples in social and biological networks are demonstrated, which show that critical nodes carry important information about structures and dynamics of such networks. △ Less

Submitted 28 April, 2012; originally announced April 2012.

arXiv:1102.3748 [pdf]

Temperature Dependence of Protein Folding Deduced from Quantum Transition

Authors: Liaofu Luo, Jun Lu

Abstract: A quantum theory on conformation-electron system is presented. Protein folding is regarded as the quantum transition between torsion states on polypeptide chain, and the folding rate is calculated by nonadiabatic operator method. The theory is used to study the temperature dependences of folding rate of 15 proteins and their non-Arrhenius behavior can all be deduced in a natural way. A general for… ▽ More A quantum theory on conformation-electron system is presented. Protein folding is regarded as the quantum transition between torsion states on polypeptide chain, and the folding rate is calculated by nonadiabatic operator method. The theory is used to study the temperature dependences of folding rate of 15 proteins and their non-Arrhenius behavior can all be deduced in a natural way. A general formula on the rate-temperature dependence has been deduced which is in good accordance with experimental data. These temperature dependences are further analyzed in terms of torsion potential parameters. Our results show it is necessary to move outside the realm of classical physics when the temperature dependence of protein folding is studied quantitatively. △ Less

Submitted 17 February, 2011; originally announced February 2011.

arXiv:1004.3843 [pdf]

Information-theoretic View of Sequence Organization in a Genome

Authors: Liaofu Luo, Yang Gao, Jun Lu

Abstract: Sequence organizations are viewed from two points: one is from informational redundancy or informational correlation (IC) and another is from k-mer frequency statistics. Two problems are investigated. The first is how the ICs exceed the fluctuation bound and the order emerges from fluctuation in a genome when the sequence length attains some critical value. We demonstrated that the transition from… ▽ More Sequence organizations are viewed from two points: one is from informational redundancy or informational correlation (IC) and another is from k-mer frequency statistics. Two problems are investigated. The first is how the ICs exceed the fluctuation bound and the order emerges from fluctuation in a genome when the sequence length attains some critical value. We demonstrated that the transition from fluctuation to order takes place at about sequence length 200-300 thousands bases for human and E coli genome. It means that the life emerges from a region between macroscopic and microscopic. The second is about the statistical law of the k-mer organization in a genome under the evolutionary pressure and functional selection. We deduced a sum rule Q(k,N) on the k-mer frequency deviations from the randomness in a N-long sequence of genome and deduced the relations of Q(k,N) with k and N. We found that Q(k,N) increases with length N at a constant rate for most genome sequences and demonstrated that when the functional selection of k-mers is accumulated to some critical value the ordering takes place. An important finding is the sum rule correlated with the evolutionary complexity of the genome. △ Less

Submitted 22 April, 2010; originally announced April 2010.

Comments: 19 pages, 4 figures

arXiv:0710.2510 [pdf, other]

doi 10.1529/biophysj.107.123851

Intracellular microrheology of motile Amoeba proteus

Authors: Salman S. Rogers, Thomas A. Waigh, Jian R. Lu

Abstract: The motility of motile Amoeba proteus was examined using the technique of passive particle tracking microrheology, with the aid of newly-developed particle tracking software, a fast digital camera and an optical microscope. We tracked large numbers of endogeneous particles in the amoebae, which displayed subdiffusive motion at short time scales, corresponding to thermal motion in a viscoelastic… ▽ More The motility of motile Amoeba proteus was examined using the technique of passive particle tracking microrheology, with the aid of newly-developed particle tracking software, a fast digital camera and an optical microscope. We tracked large numbers of endogeneous particles in the amoebae, which displayed subdiffusive motion at short time scales, corresponding to thermal motion in a viscoelastic medium, and superdiffusive motion at long time scales due to the convection of the cytoplasm. Subdiffusive motion was characterised by a rheological scaling exponent of 3/4 in the cortex, indicative of the semiflexible dynamics of the actin fibres. We observed shear-thinning in the flowing endoplasm, where exponents increased with increasing flow rate; i.e. the endoplasm became more fluid-like. The rheology of the cortex is found to be isotropic, reflecting an isotropic actin gel. A clear difference was seen between cortical and endoplasmic layers in terms of both viscoelasticity and flow velocity, where the profile of the latter is close to a Poiseuille flow for a Newtonian fluid. △ Less

Submitted 12 October, 2007; originally announced October 2007.

arXiv:0706.0194 [pdf]

Comparing Classical Pathways and Modern Networks: Towards the Development of an Edge Ontology

Authors: Long J. Lu, Andrea Sboner, Yuanpeng J. Huang, Hao Xin Lu, Tara A. Gianoulis, Kevin Y. Yip, Philip M. Kim, Gaetano T. Montelione, Mark B. Gerstein

Abstract: Pathways are integral to systems biology. Their classical representation has proven useful but is inconsistent in the meaning assigned to each arrow (or edge) and inadvertently implies the isolation of one pathway from another. Conversely, modern high-throughput experiments give rise to standardized networks facilitating topological calculations. Combining these perspectives, we can embed classi… ▽ More Pathways are integral to systems biology. Their classical representation has proven useful but is inconsistent in the meaning assigned to each arrow (or edge) and inadvertently implies the isolation of one pathway from another. Conversely, modern high-throughput experiments give rise to standardized networks facilitating topological calculations. Combining these perspectives, we can embed classical pathways within large-scale networks and thus demonstrate the crosstalk between them. As more diverse types of high-throughput data become available, we can effectively merge both perspectives, embedding pathways simultaneously in multiple networks. However, the original problem still remains - the current edge representation is inadequate to accurately convey all the information in pathways. Therefore, we suggest that a standardized, well-defined, edge ontology is necessary and propose a prototype here, as a starting point for reaching this goal. △ Less

Submitted 1 June, 2007; originally announced June 2007.

Comments: 30 pages including 5 figures and supplemental material

Showing 1–30 of 30 results for author: Lu, J