Search | arXiv e-print repository

BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers

Authors: Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May D. Wang, Joyce C. Ho, Chao Zhang, Carl Yang

Abstract: Developing effective biomedical retrieval models is important for excelling at knowledge-intensive biomedical tasks but still challenging due to the deficiency of sufficient publicly annotated biomedical data and computational resources. We present BMRetriever, a series of dense retrievers for enhancing biomedical retrieval via unsupervised pre-training on large biomedical corpora, followed by ins… ▽ More Developing effective biomedical retrieval models is important for excelling at knowledge-intensive biomedical tasks but still challenging due to the deficiency of sufficient publicly annotated biomedical data and computational resources. We present BMRetriever, a series of dense retrievers for enhancing biomedical retrieval via unsupervised pre-training on large biomedical corpora, followed by instruction fine-tuning on a combination of labeled datasets and synthetic pairs. Experiments on 5 biomedical tasks across 11 datasets verify BMRetriever's efficacy on various biomedical applications. BMRetriever also exhibits strong parameter efficiency, with the 410M variant outperforming baselines up to 11.7 times larger, and the 2B variant matching the performance of models with over 5B parameters. The training data and model checkpoints are released at \url{https://huggingface.co/BMRetriever} to ensure transparency, reproducibility, and application to new domains. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: Work in progress. The model and data will be uploaded to \url{https://github.com/ritaranx/BMRetriever}

arXiv:2403.14202 [pdf, other]

Two fitness inference schemes compared using allele frequencies from 1,068,391 sequences sampled in the UK during the COVID-19 pandemic

Authors: Hong-Li Zeng, Cheng-Long Yang, Bo Jing, John Barton, Erik Aurell

Abstract: Throughout the course of the SARS-CoV-2 pandemic, genetic variation has contributed to the spread and persistence of the virus. For example, various mutations have allowed SARS-CoV-2 to escape antibody neutralization or to bind more strongly to the receptors that it uses to enter human cells. Here, we compared two methods that estimate the fitness effects of viral mutations using the abundant sequ… ▽ More Throughout the course of the SARS-CoV-2 pandemic, genetic variation has contributed to the spread and persistence of the virus. For example, various mutations have allowed SARS-CoV-2 to escape antibody neutralization or to bind more strongly to the receptors that it uses to enter human cells. Here, we compared two methods that estimate the fitness effects of viral mutations using the abundant sequence data gathered over the course of the pandemic. Both approaches are grounded in population genetics theory but with different assumptions. One approach, tQLE, features an epistatic fitness landscape and assumes that alleles are nearly in linkage equilibrium. Another approach, MPL, assumes a simple, additive fitness landscape, but allows for any level of correlation between alleles. We characterized differences in the distributions of fitness values inferred by each approach and in the ranks of fitness values that they assign to sequences across time. We find that in a large fraction of weeks the two methods are in good agreement as to their top-ranked sequences, i.e., as to which sequences observed that week are most fit. We also find that agreement between ranking of sequences varies with genetic unimodality in the population in a given week. △ Less

Submitted 21 March, 2024; originally announced March 2024.

Comments: 10 pages, 6 figures

arXiv:2403.00815 [pdf, other]

RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records

Authors: Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Bowen Jin, May D. Wang, Joyce C. Ho, Carl Yang

Abstract: We present RAM-EHR, a Retrieval AugMentation pipeline to improve clinical predictions on Electronic Health Records (EHRs). RAM-EHR first collects multiple knowledge sources, converts them into text format, and uses dense retrieval to obtain information related to medical concepts. This strategy addresses the difficulties associated with complex names for the concepts. RAM-EHR then augments the loc… ▽ More We present RAM-EHR, a Retrieval AugMentation pipeline to improve clinical predictions on Electronic Health Records (EHRs). RAM-EHR first collects multiple knowledge sources, converts them into text format, and uses dense retrieval to obtain information related to medical concepts. This strategy addresses the difficulties associated with complex names for the concepts. RAM-EHR then augments the local EHR predictive model co-trained with consistency regularization to capture complementary information from patient visits and summarized knowledge. Experiments on two EHR datasets show the efficacy of RAM-EHR over previous knowledge-enhanced baselines (3.4% gain in AUROC and 7.2% gain in AUPR), emphasizing the effectiveness of the summarized knowledge from RAM-EHR for clinical prediction tasks. The code will be published at \url{https://github.com/ritaranx/RAM-EHR}. △ Less

Submitted 4 June, 2024; v1 submitted 25 February, 2024; originally announced March 2024.

Comments: ACL 2024

Journal ref: ACL 2024

arXiv:2403.00005 [pdf]

doi 10.1038/s41467-022-28483-6

Organic electrochemical neurons and synapses with ion mediated spiking

Authors: H. Padinhare, C. Yang, D. Tu, J. Gerasimov, A. M. M. Dar, A. A. Moreira, M. Massetti, R. Kroon, D. Bliman, R. Olsson, E. Stavrinidou, M. Berggren, S. Fabiano

Abstract: Future brain-machine interfaces, prosthetics, and intelligent soft robotics will require integrating artificial neuromorphic devices with biological systems. Due to their poor biocompatibility, circuit complexity, low energy efficiency, and operating principles fundamentally different from the ion signal modulation of biology, traditional Silicon-based neuromorphic implementations have limited bio… ▽ More Future brain-machine interfaces, prosthetics, and intelligent soft robotics will require integrating artificial neuromorphic devices with biological systems. Due to their poor biocompatibility, circuit complexity, low energy efficiency, and operating principles fundamentally different from the ion signal modulation of biology, traditional Silicon-based neuromorphic implementations have limited bio-integration potential. Here, we report the first organic electrochemical neurons (OECNs) with ion-modulated spiking, based on allprinted complementary organic electrochemical transistors. We demonstrate facile biointegration of OECNs with Venus Flytrap (Dionaea muscipula) to induce lobe closure upon input stimuli. The OECNs can also be integrated with all-printed organic electrochemical synapses (OECSs), exhibiting short-term plasticity with paired-pulse facilitation and longterm plasticity with retention >1000 s, facilitating Hebbian learning. These soft and flexible OECNs operate below 0.6 V and respond to multiple stimuli, defining a new vista for localized artificial neuronal systems possible to integrate with bio-signaling systems of plants, invertebrates, and vertebrates. △ Less

Submitted 18 January, 2024; originally announced March 2024.

arXiv:2311.08611 [pdf, ps, other]

Theory of Infectious Diseases with Testing and Testing-less Covid-19 Endemic

Authors: Bo Deng, Chayu Yang

Abstract: What is the long term dynamics of the Covid-19 pandemic? How will it end? Here we constructed an infectious disease model with testing and analyzed the existence and stability of its endemic states. For a large parameter set, including those relevant to the SARS-CoV-2 virus, we demonstrated the existence of one endemic equilibrium without testing and one endemic equilibrium with testing and proved… ▽ More What is the long term dynamics of the Covid-19 pandemic? How will it end? Here we constructed an infectious disease model with testing and analyzed the existence and stability of its endemic states. For a large parameter set, including those relevant to the SARS-CoV-2 virus, we demonstrated the existence of one endemic equilibrium without testing and one endemic equilibrium with testing and proved their local and global stabilities for some cases. Our results suggest that the pandemic is to end with a testing-less endemic state through a novel and surprising mechanism called stochastic trapping. △ Less

Submitted 14 November, 2023; originally announced November 2023.

arXiv:2311.00287 [pdf, other]

Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models

Authors: Ran Xu, Hejie Cui, Yue Yu, Xuan Kan, Wenqi Shi, Yuchen Zhuang, Wei Jin, Joyce Ho, Carl Yang

Abstract: Clinical natural language processing requires methods that can address domain-specific challenges, such as complex medical terminology and clinical contexts. Recently, large language models (LLMs) have shown promise in this domain. Yet, their direct deployment can lead to privacy issues and are constrained by resources. To address this challenge, we delve into synthetic clinical text generation us… ▽ More Clinical natural language processing requires methods that can address domain-specific challenges, such as complex medical terminology and clinical contexts. Recently, large language models (LLMs) have shown promise in this domain. Yet, their direct deployment can lead to privacy issues and are constrained by resources. To address this challenge, we delve into synthetic clinical text generation using LLMs for clinical NLP tasks. We propose an innovative, resource-efficient approach, ClinGen, which infuses knowledge into the process. Our model involves clinical knowledge extraction and context-informed LLM prompting. Both clinical topics and writing styles are drawn from external domain-specific knowledge graphs and LLMs to guide data generation. Our extensive empirical study across 7 clinical NLP tasks and 16 datasets reveals that ClinGen consistently enhances performance across various tasks, effectively aligning the distribution of real datasets and significantly enriching the diversity of generated training instances. We will publish our code and all the generated data in \url{https://github.com/ritaranx/ClinGen}. △ Less

Submitted 1 November, 2023; originally announced November 2023.

arXiv:2309.01941 [pdf, other]

Dynamic Brain Transformer with Multi-level Attention for Functional Brain Network Analysis

Authors: Xuan Kan, Antonio Aodong Chen Gu, Hejie Cui, Ying Guo, Carl Yang

Abstract: Recent neuroimaging studies have highlighted the importance of network-centric brain analysis, particularly with functional magnetic resonance imaging. The emergence of Deep Neural Networks has fostered a substantial interest in predicting clinical outcomes and categorizing individuals based on brain networks. However, the conventional approach involving static brain network analysis offers limite… ▽ More Recent neuroimaging studies have highlighted the importance of network-centric brain analysis, particularly with functional magnetic resonance imaging. The emergence of Deep Neural Networks has fostered a substantial interest in predicting clinical outcomes and categorizing individuals based on brain networks. However, the conventional approach involving static brain network analysis offers limited potential in capturing the dynamism of brain function. Although recent studies have attempted to harness dynamic brain networks, their high dimensionality and complexity present substantial challenges. This paper proposes a novel methodology, Dynamic bRAin Transformer (DART), which combines static and dynamic brain networks for more effective and nuanced brain function analysis. Our model uses the static brain network as a baseline, integrating dynamic brain networks to enhance performance against traditional methods. We innovatively employ attention mechanisms, enhancing model explainability and exploiting the dynamic brain network's temporal variations. The proposed approach offers a robust solution to the low signal-to-noise ratio of blood-oxygen-level-dependent signals, a recurring issue in direct DNN modeling. It also provides valuable insights into which brain circuits or dynamic networks contribute more to final predictions. As such, DRAT shows a promising direction in neuroimaging studies, contributing to the comprehensive understanding of brain organization and the role of neural circuits. △ Less

Submitted 5 September, 2023; originally announced September 2023.

Comments: Accepted to IEEE BHI 2023

MSC Class: 68T07; 68T05 ACM Class: I.2.6; J.3

arXiv:2306.11976 [pdf, other]

Interactive Molecular Discovery with Natural Language

Authors: Zheni Zeng, Bangchen Yin, Shipeng Wang, Jiarui Liu, Cheng Yang, Haishen Yao, Xingzhi Sun, Maosong Sun, Guotong Xie, Zhiyuan Liu

Abstract: Natural language is expected to be a key medium for various human-machine interactions in the era of large language models. When it comes to the biochemistry field, a series of tasks around molecules (e.g., property prediction, molecule mining, etc.) are of great significance while having a high technical threshold. Bridging the molecule expressions in natural language and chemical language can no… ▽ More Natural language is expected to be a key medium for various human-machine interactions in the era of large language models. When it comes to the biochemistry field, a series of tasks around molecules (e.g., property prediction, molecule mining, etc.) are of great significance while having a high technical threshold. Bridging the molecule expressions in natural language and chemical language can not only hugely improve the interpretability and reduce the operation difficulty of these tasks, but also fuse the chemical knowledge scattered in complementary materials for a deeper comprehension of molecules. Based on these benefits, we propose the conversational molecular design, a novel task adopting natural language for describing and editing target molecules. To better accomplish this task, we design ChatMol, a knowledgeable and versatile generative pre-trained model, enhanced by injecting experimental property information, molecular spatial knowledge, and the associations between natural and chemical languages into it. Several typical solutions including large language models (e.g., ChatGPT) are evaluated, proving the challenge of conversational molecular design and the effectiveness of our knowledge enhancement method. Case observations and analysis are conducted to provide directions for further exploration of natural-language interaction in molecular discovery. △ Less

Submitted 20 June, 2023; originally announced June 2023.

arXiv:2306.02532 [pdf, other]

doi 10.1145/3580305.3599483

R-Mixup: Riemannian Mixup for Biological Networks

Authors: Xuan Kan, Zimu Li, Hejie Cui, Yue Yu, Ran Xu, Shaojun Yu, Zilong Zhang, Ying Guo, Carl Yang

Abstract: Biological networks are commonly used in biomedical and healthcare domains to effectively model the structure of complex biological systems with interactions linking biological entities. However, due to their characteristics of high dimensionality and low sample size, directly applying deep learning models on biological networks usually faces severe overfitting. In this work, we propose R-MIXUP, a… ▽ More Biological networks are commonly used in biomedical and healthcare domains to effectively model the structure of complex biological systems with interactions linking biological entities. However, due to their characteristics of high dimensionality and low sample size, directly applying deep learning models on biological networks usually faces severe overfitting. In this work, we propose R-MIXUP, a Mixup-based data augmentation technique that suits the symmetric positive definite (SPD) property of adjacency matrices from biological networks with optimized training efficiency. The interpolation process in R-MIXUP leverages the log-Euclidean distance metrics from the Riemannian manifold, effectively addressing the swelling effect and arbitrarily incorrect label issues of vanilla Mixup. We demonstrate the effectiveness of R-MIXUP with five real-world biological network datasets on both regression and classification tasks. Besides, we derive a commonly ignored necessary condition for identifying the SPD matrices of biological networks and empirically study its influence on the model performance. The code implementation can be found in Appendix E. △ Less

Submitted 4 June, 2023; originally announced June 2023.

Comments: Accepted to KDD 2023

MSC Class: 68T07; 68T05 ACM Class: I.2.6; J.3

arXiv:2305.14376 [pdf, other]

PTGB: Pre-Train Graph Neural Networks for Brain Network Analysis

Authors: Yi Yang, Hejie Cui, Carl Yang

Abstract: The human brain is the central hub of the neurobiological system, controlling behavior and cognition in complex ways. Recent advances in neuroscience and neuroimaging analysis have shown a growing interest in the interactions between brain regions of interest (ROIs) and their impact on neural development and disorder diagnosis. As a powerful deep model for analyzing graph-structured data, Graph Ne… ▽ More The human brain is the central hub of the neurobiological system, controlling behavior and cognition in complex ways. Recent advances in neuroscience and neuroimaging analysis have shown a growing interest in the interactions between brain regions of interest (ROIs) and their impact on neural development and disorder diagnosis. As a powerful deep model for analyzing graph-structured data, Graph Neural Networks (GNNs) have been applied for brain network analysis. However, training deep models requires large amounts of labeled data, which is often scarce in brain network datasets due to the complexities of data acquisition and sharing restrictions. To make the most out of available training data, we propose PTGB, a GNN pre-training framework that captures intrinsic brain network structures, regardless of clinical outcomes, and is easily adaptable to various downstream tasks. PTGB comprises two key components: (1) an unsupervised pre-training technique designed specifically for brain networks, which enables learning from large-scale datasets without task-specific labels; (2) a data-driven parcellation atlas mapping pipeline that facilitates knowledge transfer across datasets with different ROI systems. Extensive evaluations using various GNN models have demonstrated the robust and superior performance of PTGB compared to baseline methods. △ Less

Submitted 20 May, 2023; originally announced May 2023.

Comments: Accepted to CHIL 2023, 19 pages

arXiv:2305.04142 [pdf, other]

Transformer-Based Hierarchical Clustering for Brain Network Analysis

Authors: Wei Dai, Hejie Cui, Xuan Kan, Ying Guo, Sanne van Rooij, Carl Yang

Abstract: Brain networks, graphical models such as those constructed from MRI, have been widely used in pathological prediction and analysis of brain functions. Within the complex brain system, differences in neuronal connection strengths parcellate the brain into various functional modules (network communities), which are critical for brain analysis. However, identifying such communities within the brain h… ▽ More Brain networks, graphical models such as those constructed from MRI, have been widely used in pathological prediction and analysis of brain functions. Within the complex brain system, differences in neuronal connection strengths parcellate the brain into various functional modules (network communities), which are critical for brain analysis. However, identifying such communities within the brain has been a nontrivial issue due to the complexity of neuronal interactions. In this work, we propose a novel interpretable transformer-based model for joint hierarchical cluster identification and brain network classification. Extensive experimental results on real-world brain network datasets show that with the help of hierarchical clustering, the model achieves increased accuracy and reduced runtime complexity while providing plausible insight into the functional organization of brain regions. The implementation is available at https://github.com/DDVD233/THC. △ Less

Submitted 6 May, 2023; originally announced May 2023.

Comments: Accepted to IEEE-ISBI 2023

MSC Class: 68T07; 68T45; 68T20 ACM Class: I.2.6; I.2.10; J.3

arXiv:2301.03659 [pdf]

Multifunctional fiber-based optoacoustic emitter for non-genetic bidirectional neural communication

Authors: Nan Zheng, Ying Jiang, Shan Jiang, Jongwoon Kim, Yueming Li, Ji-Xin Cheng, Xiaoting Jia, Chen Yang

Abstract: A bidirectional brain interface with both "write" and "read" functions can be an important tool for fundamental studies and potential clinical treatments for neurological diseases. Here we report a miniaturized multifunctional fiber based optoacoustic emitter (mFOE) that first integrates simultaneous non-genetic optoacoustic stimulation for "write" and electrophysiology recording of neural circuit… ▽ More A bidirectional brain interface with both "write" and "read" functions can be an important tool for fundamental studies and potential clinical treatments for neurological diseases. Here we report a miniaturized multifunctional fiber based optoacoustic emitter (mFOE) that first integrates simultaneous non-genetic optoacoustic stimulation for "write" and electrophysiology recording of neural circuits for "read". The non-genetic feature addresses the challenges of the viral transfection required by optogenetics in primates and human. The orthogonality between optoacoustic waves and electrical field provides a solution to avoid the interference between electrical stimulation and recording. We first validated the non-genetic stimulation function of the mFOE in rat cultured neurons using calcium imaging. In vivo application of mFOE for successful simultaneous optoacoustic stimulation and electrical recording of brain activities was confirmed in mouse hippocampus in both acute and chronical applications up to 1 month. Minimal brain tissue damage has been confirmed after these applications. The capability of non-genetic neural stimulation and recording enabled by mFOE opens up new possibilities for the investigation of neural circuits and brings new insights into the study of ultrasound neurostimulation. △ Less

Submitted 9 January, 2023; originally announced January 2023.

arXiv:2212.00735 [pdf, other]

xTrimoABFold: De novo Antibody Structure Prediction without MSA

Authors: Yining Wang, Xumeng Gong, Shaochuan Li, Bing Yang, YiWu Sun, Chuan Shi, Yangang Wang, Cheng Yang, Hui Li, Le Song

Abstract: In the field of antibody engineering, an essential task is to design a novel antibody whose paratopes bind to a specific antigen with correct epitopes. Understanding antibody structure and its paratope can facilitate a mechanistic understanding of its function. Therefore, antibody structure prediction from its sequence alone has always been a highly valuable problem for de novo antibody design. Al… ▽ More In the field of antibody engineering, an essential task is to design a novel antibody whose paratopes bind to a specific antigen with correct epitopes. Understanding antibody structure and its paratope can facilitate a mechanistic understanding of its function. Therefore, antibody structure prediction from its sequence alone has always been a highly valuable problem for de novo antibody design. AlphaFold2, a breakthrough in the field of structural biology, provides a solution to predict protein structure based on protein sequences and computationally expensive coevolutionary multiple sequence alignments (MSAs). However, the computational efficiency and undesirable prediction accuracy of antibodies, especially on the complementarity-determining regions (CDRs) of antibodies limit their applications in the industrially high-throughput drug design. To learn an informative representation of antibodies, we employed a deep antibody language model (ALM) on curated sequences from the observed antibody space database via a transformer model. We also developed a novel model named xTrimoABFold to predict antibody structure from antibody sequence based on the pretrained ALM as well as efficient evoformers and structural modules. The model was trained end-to-end on the antibody structures in PDB by minimizing the ensemble loss of domain-specific focal loss on CDR and the frame-aligned point loss. xTrimoABFold outperforms AlphaFold2 and other protein language model based SOTAs, e.g., OmegaFold, HelixFold-Single, and IgFold with a large significant margin (30+\% improvement on RMSD) while performing 151 times faster than AlphaFold2. To the best of our knowledge, xTrimoABFold achieved state-of-the-art antibody structure prediction. Its improvement in both accuracy and efficiency makes it a valuable tool for de novo antibody design and could make further improvements in immuno-theory. △ Less

Submitted 4 May, 2023; v1 submitted 30 November, 2022; originally announced December 2022.

Comments: 14 pages, 5 figures

arXiv:2211.16214 [pdf]

A biologically interfaced evolvable organic pattern classifier

Authors: Jennifer Gerasimov, Deyu Tu, Vivek Hitaishi, Padinhare Cholakkal Harikesh, Chi-Yuan Yang, Tobias Abrahamsson, Meysam Rad, Mary J. Donahue, Malin Silverå Ejneby, Magnus Berggren, Robert Forchheimer, Simone Fabiano

Abstract: Future brain-computer interfaces will require local and highly individualized signal processing of fully integrated electronic circuits within the nervous system and other living tissue. New devices will need to be developed that can receive data from a sensor array, process data into meaningful information, and translate that information into a format that living systems can interpret. Here, we r… ▽ More Future brain-computer interfaces will require local and highly individualized signal processing of fully integrated electronic circuits within the nervous system and other living tissue. New devices will need to be developed that can receive data from a sensor array, process data into meaningful information, and translate that information into a format that living systems can interpret. Here, we report the first example of interfacing a hardware-based pattern classifier with a biological nerve. The classifier implements the Widrow-Hoff learning algorithm on an array of evolvable organic electrochemical transistors (EOECTs). The EOECTs' channel conductance is modulated in situ by electropolymerizing the semiconductor material within the channel, allowing for low voltage operation, high reproducibility, and an improvement in state retention of two orders of magnitude over state-of-the-art OECT devices. The organic classifier is interfaced with a biological nerve using an organic electrochemical spiking neuron to translate the classifier's output to a simulated action potential. The latter is then used to stimulate muscle contraction selectively based on the input pattern, thus paving the way for the development of closed-loop therapeutic systems. △ Less

Submitted 29 November, 2022; originally announced November 2022.

arXiv:2211.00261 [pdf, other]

Learning Task-Aware Effective Brain Connectivity for fMRI Analysis with Graph Neural Networks

Authors: Yue Yu, Xuan Kan, Hejie Cui, Ran Xu, Yujia Zheng, Xiangchen Song, Yanqiao Zhu, Kun Zhang, Razieh Nabi, Ying Guo, Chao Zhang, Carl Yang

Abstract: Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROI), which are noisy and agnostic to the downs… ▽ More Functional magnetic resonance imaging (fMRI) has become one of the most common imaging modalities for brain function analysis. Recently, graph neural networks (GNN) have been adopted for fMRI analysis with superior performance. Unfortunately, traditional functional brain networks are mainly constructed based on similarities among region of interests (ROI), which are noisy and agnostic to the downstream prediction tasks and can lead to inferior results for GNN-based models. To better adapt GNNs for fMRI analysis, we propose TBDS, an end-to-end framework based on \underline{T}ask-aware \underline{B}rain connectivity \underline{D}AG (short for Directed Acyclic Graph) \underline{S}tructure generation for fMRI analysis. The key component of TBDS is the brain network generator which adopts a DAG learning approach to transform the raw time-series into task-aware brain connectivities. Besides, we design an additional contrastive regularization to inject task-specific knowledge during the brain network generation process. Comprehensive experiments on two fMRI datasets, namely Adolescent Brain Cognitive Development (ABCD) and Philadelphia Neuroimaging Cohort (PNC) datasets demonstrate the efficacy of TBDS. In addition, the generated brain networks also highlight the prediction-related brain regions and thus provide unique interpretations of the prediction results. Our implementation will be published to https://github.com/yueyu1030/TBDS upon acceptance. △ Less

Submitted 31 October, 2022; originally announced November 2022.

Comments: Work in progress

arXiv:2210.10871 [pdf]

Stable ion-tunable antiambipolarity in mixed ion-electron conducting polymers enables biorealistic artificial neurons

Authors: Padinhare Cholakkal Harikesh, Chi-Yuan Yang, Han-Yan Wu, Silan Zhang, Jun-Da Huang, Magnus Berggren, Deyu Tu, Simone Fabiano

Abstract: Bio-integrated neuromorphic systems promise for new protocols to record and regulate the signaling of biological systems. Making such artificial neural circuits successful requires minimal circuit complexity and ion-based operating mechanisms similar to that of biology. However, simple leaky integrate-and-fire model neurons, commonly realized in either silicon or organic semiconductor neuromorphic… ▽ More Bio-integrated neuromorphic systems promise for new protocols to record and regulate the signaling of biological systems. Making such artificial neural circuits successful requires minimal circuit complexity and ion-based operating mechanisms similar to that of biology. However, simple leaky integrate-and-fire model neurons, commonly realized in either silicon or organic semiconductor neuromorphic systems, can emulate only a few neural features. More functional neuron models, based on traditional complex Si-based complementary-metal-oxide-semiconductor (CMOS) or negative differential resistance (NDR) device circuits, are complicated to fabricate, not biocompatible, and lack ion- and chemical-based modulation features. Here we report a biorealistic conductance-based organic electrochemical neuron (c-OECN) using a mixed ion-electron conducting ladder-type polymer with reliable ion-tunable antiambipolarity. The latter is used to emulate the activation/inactivation of Na channels and delayed activation of K channels of biological neurons. These c-OECNs can then spike at bioplausible frequencies nearing 100 Hz, emulate most critical biological neural features, demonstrate stochastic spiking, and enable neurotransmitter and Ca2+-based spiking modulation. These combined features are impossible to achieve using previous technologies. △ Less

Submitted 19 October, 2022; originally announced October 2022.

arXiv:2208.06487 [pdf, other]

Scaling and the Universality of Function Diversity Across Human Organizations

Authors: Vicky Chuqiao Yang, Christopher P. Kempes, Hyejin Youn, Sidney Redner, Geoffrey B. West

Abstract: Function diversity, namely, the range of tasks individuals can perform, is essential to productive organizations. This concept has been studied in disparate discipline contexts, while general patterns and mechanisms remain unclear. Here, we first analyze over five thousand observations of top-down organizations -- US federal agencies, Norwegian companies, and US universities, and find that the num… ▽ More Function diversity, namely, the range of tasks individuals can perform, is essential to productive organizations. This concept has been studied in disparate discipline contexts, while general patterns and mechanisms remain unclear. Here, we first analyze over five thousand observations of top-down organizations -- US federal agencies, Norwegian companies, and US universities, and find that the number of distinct functions scales with organizational size, approximately as a power law with an exponent of 1/2. Further, we find common patterns in the distribution of function abundance within organizations. This universality suggests that human organizations, despite differences in their purpose, structure, and culture, may share common mechanisms for creating specializations. Additionally, we find that cities -- bottom-up organizations -- differ from top-down organizations and exhibit logarithmic scaling. We discuss potential avenues for modeling the mechanisms for these observations using history-dependent random processes, and offer several criteria for model selection. △ Less

Submitted 12 August, 2022; originally announced August 2022.

Comments: 13 pages, 3 figures

arXiv:2207.11547 [pdf]

A Ligand-and-structure Dual-driven Deep Learning Method for the Discovery of Highly Potent GnRH1R Antagonist to treat Uterine Diseases

Authors: Song Li, Song Ke, Chenxing Yang, Jun Chen, Yi Xiong, Lirong Zheng, Hao Liu, Liang Hong

Abstract: Gonadotrophin-releasing hormone receptor (GnRH1R) is a promising therapeutic target for the treatment of uterine diseases. To date, several GnRH1R antagonists are available in clinical investigation without satisfying multiple property constraints. To fill this gap, we aim to develop a deep learning-based framework to facilitate the effective and efficient discovery of a new orally active small-mo… ▽ More Gonadotrophin-releasing hormone receptor (GnRH1R) is a promising therapeutic target for the treatment of uterine diseases. To date, several GnRH1R antagonists are available in clinical investigation without satisfying multiple property constraints. To fill this gap, we aim to develop a deep learning-based framework to facilitate the effective and efficient discovery of a new orally active small-molecule drug targeting GnRH1R with desirable properties. In the present work, a ligand-and-structure combined model, namely LS-MolGen, was firstly proposed for molecular generation by fully utilizing the information on the known active compounds and the structure of the target protein, which was demonstrated by its superior performance than ligand- or structure-based methods separately. Then, a in silico screening including activity prediction, ADMET evaluation, molecular docking and FEP calculation was conducted, where ~30,000 generated novel molecules were narrowed down to 8 for experimental synthesis and validation. In vitro and in vivo experiments showed that three of them exhibited potent inhibition activities (compound 5 IC50 = 0.856 nM, compound 6 IC50 = 0.901 nM, compound 7 IC50 = 2.54 nM) against GnRH1R, and compound 5 performed well in fundamental PK properties, such as half-life, oral bioavailability, and PPB, etc. We believed that the proposed ligand-and-structure combined molecular generative model and the whole computer-aided workflow can potentially be extended to similar tasks for de novo drug design or lead optimization. △ Less

Submitted 23 July, 2022; originally announced July 2022.

arXiv:2207.00813 [pdf, other]

Interpretable Graph Neural Networks for Connectome-Based Brain Disorder Analysis

Authors: Hejie Cui, Wei Dai, Yanqiao Zhu, Xiaoxiao Li, Lifang He, Carl Yang

Abstract: Human brains lie at the core of complex neurobiological systems, where the neurons, circuits, and subsystems interact in enigmatic ways. Understanding the structural and functional mechanisms of the brain has long been an intriguing pursuit for neuroscience research and clinical disorder therapy. Mapping the connections of the human brain as a network is one of the most pervasive paradigms in neur… ▽ More Human brains lie at the core of complex neurobiological systems, where the neurons, circuits, and subsystems interact in enigmatic ways. Understanding the structural and functional mechanisms of the brain has long been an intriguing pursuit for neuroscience research and clinical disorder therapy. Mapping the connections of the human brain as a network is one of the most pervasive paradigms in neuroscience. Graph Neural Networks (GNNs) have recently emerged as a potential method for modeling complex network data. Deep models, on the other hand, have low interpretability, which prevents their usage in decision-critical contexts like healthcare. To bridge this gap, we propose an interpretable framework to analyze disorder-specific Regions of Interest (ROIs) and prominent connections. The proposed framework consists of two modules: a brain-network-oriented backbone model for disease prediction and a globally shared explanation generator that highlights disorder-specific biomarkers including salient ROIs and important connections. We conduct experiments on three real-world datasets of brain disorders. The results verify that our framework can obtain outstanding performance and also identify meaningful biomarkers. All code for this work is available at https://github.com/HennyJie/IBGNN.git. △ Less

Submitted 23 July, 2022; v1 submitted 30 June, 2022; originally announced July 2022.

Comments: Previous version presented at icml-imlh 2021 (no proceedings, archived at 2107.05097), this version is accepted to miccai 2022

MSC Class: 68T07; 92C50; ACM Class: I.2.0; I.2.6; J.3

arXiv:2206.04486 [pdf, other]

doi 10.1145/3534678.3542680

Data-Efficient Brain Connectome Analysis via Multi-Task Meta-Learning

Authors: Yi Yang, Yanqiao Zhu, Hejie Cui, Xuan Kan, Lifang He, Ying Guo, Carl Yang

Abstract: Brain networks characterize complex connectivities among brain regions as graph structures, which provide a powerful means to study brain connectomes. In recent years, graph neural networks have emerged as a prevalent paradigm of learning with structured data. However, most brain network datasets are limited in sample sizes due to the relatively high cost of data acquisition, which hinders the dee… ▽ More Brain networks characterize complex connectivities among brain regions as graph structures, which provide a powerful means to study brain connectomes. In recent years, graph neural networks have emerged as a prevalent paradigm of learning with structured data. However, most brain network datasets are limited in sample sizes due to the relatively high cost of data acquisition, which hinders the deep learning models from sufficient training. Inspired by meta-learning that learns new concepts fast with limited training examples, this paper studies data-efficient training strategies for analyzing brain connectomes in a cross-dataset setting. Specifically, we propose to meta-train the model on datasets of large sample sizes and transfer the knowledge to small datasets. In addition, we also explore two brain-network-oriented designs, including atlas transformation and adaptive task reweighing. Compared to other pre-training strategies, our meta-learning-based approach achieves higher and stabler performance, which demonstrates the effectiveness of our proposed solutions. The framework is also able to derive new insights regarding the similarities among datasets and diseases in a data-driven fashion. △ Less

Submitted 9 June, 2022; originally announced June 2022.

Comments: Accepted to KDD 2022 (Health Day), 9 pages

arXiv:2205.12465 [pdf, other]

FBNETGEN: Task-aware GNN-based fMRI Analysis via Functional Brain Network Generation

Authors: Xuan Kan, Hejie Cui, Joshua Lukemire, Ying Guo, Carl Yang

Abstract: Functional magnetic resonance imaging (fMRI) is one of the most common imaging modalities to investigate brain functions. Recent studies in neuroscience stress the great potential of functional brain networks constructed from fMRI data for clinical predictions. Traditional functional brain networks, however, are noisy and unaware of downstream prediction tasks, while also incompatible with the dee… ▽ More Functional magnetic resonance imaging (fMRI) is one of the most common imaging modalities to investigate brain functions. Recent studies in neuroscience stress the great potential of functional brain networks constructed from fMRI data for clinical predictions. Traditional functional brain networks, however, are noisy and unaware of downstream prediction tasks, while also incompatible with the deep graph neural network (GNN) models. In order to fully unleash the power of GNNs in network-based fMRI analysis, we develop FBNETGEN, a task-aware and interpretable fMRI analysis framework via deep brain network generation. In particular, we formulate (1) prominent region of interest (ROI) features extraction, (2) brain networks generation, and (3) clinical predictions with GNNs, in an end-to-end trainable model under the guidance of particular prediction tasks. Along with the process, the key novel component is the graph generator which learns to transform raw time-series features into task-oriented brain networks. Our learnable graphs also provide unique interpretations by highlighting prediction-related brain regions. Comprehensive experiments on two datasets, i.e., the recently released and currently largest publicly available fMRI dataset Adolescent Brain Cognitive Development (ABCD), and the widely-used fMRI dataset PNC, prove the superior effectiveness and interpretability of FBNETGEN. The implementation is available at https://github.com/Wayfear/FBNETGEN. △ Less

Submitted 29 May, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

Comments: This paper has been accepted for presentation in MIDL 2022

MSC Class: 68T07; 68T45; 68T20 ACM Class: I.2.6; I.2.10; J.3

arXiv:2205.11914 [pdf, other]

An Adaptive Contrastive Learning Model for Spike Sorting

Authors: Lang Qian, Shengjie Zheng, Chunshan Deng, Cheng Yang, Xiaojian Li

Abstract: Brain-computer interfaces (BCIs), is ways for electronic devices to communicate directly with the brain. For most medical-type brain-computer interface tasks, the activity of multiple units of neurons or local field potentials is sufficient for decoding. But for BCIs used in neuroscience research, it is important to separate out the activity of individual neurons. With the development of large-sca… ▽ More Brain-computer interfaces (BCIs), is ways for electronic devices to communicate directly with the brain. For most medical-type brain-computer interface tasks, the activity of multiple units of neurons or local field potentials is sufficient for decoding. But for BCIs used in neuroscience research, it is important to separate out the activity of individual neurons. With the development of large-scale silicon technology and the increasing number of probe channels, artificially interpreting and labeling spikes is becoming increasingly impractical. In this paper, we propose a novel modeling framework: Adaptive Contrastive Learning Model that learns representations from spikes through contrastive learning based on the maximizing mutual information loss function as a theoretical basis. Based on the fact that data with similar features share the same labels whether they are multi-classified or binary-classified. With this theoretical support, we simplify the multi-classification problem into multiple binary-classification, improving both the accuracy and the runtime efficiency. Moreover, we also introduce a series of enhancements for the spikes, while solving the problem that the classification effect is affected because of the overlapping spikes. △ Less

Submitted 24 May, 2022; originally announced May 2022.

arXiv:2204.09119 [pdf]

doi 10.1038/s41377-022-01004-2

Optically-generated focused ultrasound for noninvasive brain stimulation with ultrahigh precision

Authors: Yueming Li, Ying Jiang, Lu Lan, Xiaowei Ge, Ran Cheng, Yuewei Zhan, Guo Chen, Linli Shi, Runyu Wang, Nan Zheng, Chen Yang, Ji-Xin Cheng

Abstract: High precision neuromodulation is a powerful tool to decipher neurocircuits and treat neurological diseases. Current non-invasive neuromodulation methods offer limited precision at the millimeter level. Here, we report optically-generated focused ultrasound (OFUS) for non-invasive brain stimulation with ultrahigh precision. OFUS is generated by a soft optoacoustic pad (SOAP) fabricated through emb… ▽ More High precision neuromodulation is a powerful tool to decipher neurocircuits and treat neurological diseases. Current non-invasive neuromodulation methods offer limited precision at the millimeter level. Here, we report optically-generated focused ultrasound (OFUS) for non-invasive brain stimulation with ultrahigh precision. OFUS is generated by a soft optoacoustic pad (SOAP) fabricated through embedding candle soot nanoparticles in a curved polydimethylsiloxane film. SOAP generates a transcranial ultrasound focus at 15 MHz with an ultrahigh lateral resolution of 83 um, which is two orders of magnitude smaller than that of conventional transcranial-focused ultrasound (tFUS). Here, we show effective OFUS neurostimulation in vitro with a single ultrasound cycle. We demonstrate submillimeter transcranial stimulation of the mouse motor cortex in vivo. An acoustic energy of 0.6 mJ/cm^2, four orders of magnitude less than that of tFUS, is sufficient for successful OFUS neurostimulation. OFUS offers new capabilities for neuroscience studies and disease treatments by delivering a focus with ultrahigh precision non-invasively. △ Less

Submitted 3 November, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

Comments: 36 pages, 5 main figures, 13 supplementary figures

Journal ref: Light Sci Appl 11, 321 (2022)

arXiv:2204.07054 [pdf, other]

doi 10.1109/TMI.2022.3218745

BrainGB: A Benchmark for Brain Network Analysis with Graph Neural Networks

Authors: Hejie Cui, Wei Dai, Yanqiao Zhu, Xuan Kan, Antonio Aodong Chen Gu, Joshua Lukemire, Liang Zhan, Lifang He, Ying Guo, Carl Yang

Abstract: Mapping the connectome of the human brain using structural or functional connectivity has become one of the most pervasive paradigms for neuroimaging analysis. Recently, Graph Neural Networks (GNNs) motivated from geometric deep learning have attracted broad interest due to their established power for modeling complex networked data. Despite their superior performance in many fields, there has not… ▽ More Mapping the connectome of the human brain using structural or functional connectivity has become one of the most pervasive paradigms for neuroimaging analysis. Recently, Graph Neural Networks (GNNs) motivated from geometric deep learning have attracted broad interest due to their established power for modeling complex networked data. Despite their superior performance in many fields, there has not yet been a systematic study of how to design effective GNNs for brain network analysis. To bridge this gap, we present BrainGB, a benchmark for brain network analysis with GNNs. BrainGB standardizes the process by (1) summarizing brain network construction pipelines for both functional and structural neuroimaging modalities and (2) modularizing the implementation of GNN designs. We conduct extensive experiments on datasets across cohorts and modalities and recommend a set of general recipes for effective GNN designs on brain networks. To support open and reproducible research on GNN-based brain network analysis, we host the BrainGB website at https://braingb.us with models, tutorials, examples, as well as an out-of-box Python package. We hope that this work will provide useful empirical evidence and offer insights for future research in this novel and promising direction. △ Less

Submitted 28 November, 2022; v1 submitted 17 March, 2022; originally announced April 2022.

Comments: IEEE Transactions on Medical Imaging

arXiv:2204.03408 [pdf, other]

Surface Vision Transformers: Flexible Attention-Based Modelling of Biomedical Surfaces

Authors: Simon Dahan, Hao Xu, Logan Z. J. Williams, Abdulah Fawaz, Chunhui Yang, Timothy S. Coalson, Michelle C. Williams, David E. Newby, A. David Edwards, Matthew F. Glasser, Alistair A. Young, Daniel Rueckert, Emma C. Robinson

Abstract: Recent state-of-the-art performances of Vision Transformers (ViT) in computer vision tasks demonstrate that a general-purpose architecture, which implements long-range self-attention, could replace the local feature learning operations of convolutional neural networks. In this paper, we extend ViTs to surfaces by reformulating the task of surface learning as a sequence-to-sequence learning problem… ▽ More Recent state-of-the-art performances of Vision Transformers (ViT) in computer vision tasks demonstrate that a general-purpose architecture, which implements long-range self-attention, could replace the local feature learning operations of convolutional neural networks. In this paper, we extend ViTs to surfaces by reformulating the task of surface learning as a sequence-to-sequence learning problem, by proposing patching mechanisms for general surface meshes. Sequences of patches are then processed by a transformer encoder and used for classification or regression. We validate our method on a range of different biomedical surface domains and tasks: brain age prediction in the developing Human Connectome Project (dHCP), fluid intelligence prediction in the Human Connectome Project (HCP), and coronary artery calcium score classification using surfaces from the Scottish Computed Tomography of the Heart (SCOT-HEART) dataset, and investigate the impact of pretraining and data augmentation on model performance. Results suggest that Surface Vision Transformers (SiT) demonstrate consistent improvement over geometric deep learning methods for brain age and fluid intelligence prediction and achieve comparable performance on calcium score classification to standard metrics used in clinical practice. Furthermore, analysis of transformer attention maps offers clear and individualised predictions of the features driving each task. Code is available on Github: https://github.com/metrics-lab/surface-vision-transformers △ Less

Submitted 7 April, 2022; originally announced April 2022.

Comments: 10 pages, 3 figures, Submitted to IEEE Transactions on Medical Imaging

arXiv:2203.16414 [pdf, other]

Surface Vision Transformers: Attention-Based Modelling applied to Cortical Analysis

Authors: Simon Dahan, Abdulah Fawaz, Logan Z. J. Williams, Chunhui Yang, Timothy S. Coalson, Matthew F. Glasser, A. David Edwards, Daniel Rueckert, Emma C. Robinson

Abstract: The extension of convolutional neural networks (CNNs) to non-Euclidean geometries has led to multiple frameworks for studying manifolds. Many of those methods have shown design limitations resulting in poor modelling of long-range associations, as the generalisation of convolutions to irregular surfaces is non-trivial. Motivated by the success of attention-modelling in computer vision, we translat… ▽ More The extension of convolutional neural networks (CNNs) to non-Euclidean geometries has led to multiple frameworks for studying manifolds. Many of those methods have shown design limitations resulting in poor modelling of long-range associations, as the generalisation of convolutions to irregular surfaces is non-trivial. Motivated by the success of attention-modelling in computer vision, we translate convolution-free vision transformer approaches to surface data, to introduce a domain-agnostic architecture to study any surface data projected onto a spherical manifold. Here, surface patching is achieved by representing spherical data as a sequence of triangular patches, extracted from a subdivided icosphere. A transformer model encodes the sequence of patches via successive multi-head self-attention layers while preserving the sequence resolution. We validate the performance of the proposed Surface Vision Transformer (SiT) on the task of phenotype regression from cortical surface metrics derived from the Developing Human Connectome Project (dHCP). Experiments show that the SiT generally outperforms surface CNNs, while performing comparably on registered and unregistered data. Analysis of transformer attention maps offers strong potential to characterise subtle cognitive developmental patterns. △ Less

Submitted 30 March, 2022; originally announced March 2022.

Comments: 22 pages, 6 figures, Accepted to MIDL 2022, OpenReview link https://openreview.net/forum?id=mpp843Bsf-

Journal ref: Proceedings of Machine Learning Research. 172 (2022) 282-303

arXiv:2203.15804 [pdf]

Improving The Diagnosis of Thyroid Cancer by Machine Learning and Clinical Data

Authors: Nan Miles Xi, Lin Wang, Chuanjia Yang

Abstract: Thyroid cancer is a common endocrine carcinoma that occurs in the thyroid gland. Much effort has been invested in improving its diagnosis, and thyroidectomy remains the primary treatment method. A successful operation without unnecessary side injuries relies on an accurate preoperative diagnosis. Current human assessment of thyroid nodule malignancy is prone to errors and may not guarantee an accu… ▽ More Thyroid cancer is a common endocrine carcinoma that occurs in the thyroid gland. Much effort has been invested in improving its diagnosis, and thyroidectomy remains the primary treatment method. A successful operation without unnecessary side injuries relies on an accurate preoperative diagnosis. Current human assessment of thyroid nodule malignancy is prone to errors and may not guarantee an accurate preoperative diagnosis. This study proposed a machine framework to predict thyroid nodule malignancy based on a novel clinical dataset we collected. The 10-fold cross-validation, bootstrap analysis, and permutation predictor importance were applied to estimate and interpret the model performance under uncertainty. The comparison between model prediction and expert assessment shows the advantage of our framework over human judgment in predicting thyroid nodule malignancy. Our method is accurate, interpretable, and thus useable as additional evidence in the preoperative diagnosis for thyroid cancer. △ Less

Submitted 27 March, 2022; originally announced March 2022.

arXiv:2201.03551 [pdf, ps, other]

A model-based assessment of the cost-benefit balance and the plea bargain in criminality -- A qualitative case study of the Covid-19 epidemic shedding light on the "car wash operation" in Brazil

Authors: Hyun Mo Yang, Ariana Campos Yang, Silvia Martorano Raimundo

Abstract: We developed a simple mathematical model to describe criminality and the justice system composed of the police investigation and court trial. The model assessed two features of organized crime -- the cost-benefit analysis done by the crime-susceptible to commit a crime and the whistleblowing of the law offenders. The model was formulated considering the mass action law commonly used in the disease… ▽ More We developed a simple mathematical model to describe criminality and the justice system composed of the police investigation and court trial. The model assessed two features of organized crime -- the cost-benefit analysis done by the crime-susceptible to commit a crime and the whistleblowing of the law offenders. The model was formulated considering the mass action law commonly used in the disease propagation modelings, which can shed light on the model's analysis. The crime-susceptible individuals analyze two opposing forces -- committing crime influenced by the law offenders not caught by police neither imprisonment by the court trial (benefit of enjoying the corruption incoming), and the refraction to commit crime influenced by those caught by police or condemned by a court (cost of incarceration). Moreover, we assessed the dilemma for those captured by police investigation to participate in the rewarding whistleblowing program. The model was applied to analyze the "car wash operation" against corruption in Brazil. The model analysis showed that the cost-benefit analysis of crime-susceptible individuals whether the act of bribery is worth or not determined the basic crime reproduction number (threshold); however, the rewarding whistleblowing policies improved the combat to corruption arising a sub-threshold. Some adopted mechanisms to control the Covid-19 pandemic shed light on understanding the "car wash peration" and threatens to the fight against corruption. Appropriate coverage of corruption by media, enhancement of laws against white-collar crimes, well-functioning police investigation and court trial, and the rewarding whistleblowing policies inhibited and decreased the corruption. △ Less

Submitted 22 January, 2022; v1 submitted 9 January, 2022; originally announced January 2022.

arXiv:2111.05315 [pdf]

Stain-free Detection of Embryo Polarization using Deep Learning

Authors: Cheng Shen, Adiyant Lamba, Meng Zhu, Ray Zhang, Changhuei Yang, Magdalena Zernicka Goetz

Abstract: Polarization of the mammalian embryo at the right developmental time is critical for its development to term and would be valuable in assessing the potential of human embryos. However, tracking polarization requires invasive fluorescence staining, impermissible in the in vitro fertilization clinic. Here, we report the use of artificial intelligence to detect polarization from unstained time-lapse… ▽ More Polarization of the mammalian embryo at the right developmental time is critical for its development to term and would be valuable in assessing the potential of human embryos. However, tracking polarization requires invasive fluorescence staining, impermissible in the in vitro fertilization clinic. Here, we report the use of artificial intelligence to detect polarization from unstained time-lapse movies of mouse embryos. We assembled a dataset of bright-field movie frames from 8-cell-stage embryos, side-by-side with corresponding images of fluorescent markers of cell polarization. We then used an ensemble learning model to detect whether any bright-field frame showed an embryo before or after onset of polarization. Our resulting model has an accuracy of 85% for detecting polarization, significantly outperforming human volunteers trained on the same data (61% accuracy). We discovered that our self-learning model focuses upon the angle between cells as one known cue for compaction, which precedes polarization, but it outperforms the use of this cue alone. By compressing three-dimensional time-lapsed image data into two-dimensions, we are able to reduce data to an easily manageable size for deep learning processing. In conclusion, we describe a method for detecting a key developmental feature of embryo development that avoids clinically impermissible fluorescence staining. △ Less

Submitted 8 November, 2021; originally announced November 2021.

arXiv:2109.00809 [pdf]

A review of computational tools for generating metagenome-assembled genomes from metagenomic sequencing data

Authors: Chao Yang, Debajyoti Chowdhury, Zhenmiao Zhang, William K. Cheung, Aiping Lu, Zhao Xiang Bian, Lu Zhang

Abstract: Microbes are essentially yet convolutedly linked with human lives on the earth. They critically interfere in different physiological processes and thus influence overall health status. Studying microbial species is used to be constrained to those that can be cultured in the lab. But it excluded a huge portion of the microbiome that could not survive on lab conditions. In the past few years, the cu… ▽ More Microbes are essentially yet convolutedly linked with human lives on the earth. They critically interfere in different physiological processes and thus influence overall health status. Studying microbial species is used to be constrained to those that can be cultured in the lab. But it excluded a huge portion of the microbiome that could not survive on lab conditions. In the past few years, the culture-independent metagenomic sequencing enabled us to explore the complex microbial community coexisting within and on us. Metagenomics has equipped us with new avenues of investigating the microbiome, from studying a single species to a complex community in a dynamic ecosystem. Thus, identifying the involved microbes and their genomes becomes one of the core tasks in metagenomic sequencing. Metagenome-assembled genomes are groups of contigs with similar sequence characteristics from de novo assembly and could represent the microbial genomes from metagenomic sequencing. In this paper, we reviewed a spectrum of tools for producing and annotating metagenome-assembled genomes from metagenomic sequencing data and discussed their technical and biological perspectives. △ Less

Submitted 2 September, 2021; originally announced September 2021.

arXiv:2109.00321 [pdf, other]

Matching Theory and Evidence on Covid-19 using a Stochastic Network SIR Model

Authors: M. Hashem Pesaran, Cynthia Fan Yang

Abstract: This paper develops an individual-based stochastic network SIR model for the empirical analysis of the Covid-19 pandemic. It derives moment conditions for the number of infected and active cases for single as well as multigroup epidemic models. These moment conditions are used to investigate the identification and estimation of the transmission rates. The paper then proposes a method that jointly… ▽ More This paper develops an individual-based stochastic network SIR model for the empirical analysis of the Covid-19 pandemic. It derives moment conditions for the number of infected and active cases for single as well as multigroup epidemic models. These moment conditions are used to investigate the identification and estimation of the transmission rates. The paper then proposes a method that jointly estimates the transmission rate and the magnitude of under-reporting of infected cases. Empirical evidence on six European countries matches the simulated outcomes once the under-reporting of infected cases is addressed. It is estimated that the number of actual cases could be between 4 to 10 times higher than the reported numbers in October 2020 and declined to 2 to 3 times in April 2021. The calibrated models are used in the counterfactual analyses of the impact of social distancing and vaccination on the epidemic evolution, and the timing of early interventions in the UK and Germany. △ Less

Submitted 4 January, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

arXiv:2107.05097 [pdf, other]

BrainNNExplainer: An Interpretable Graph Neural Network Framework for Brain Network based Disease Analysis

Authors: Hejie Cui, Wei Dai, Yanqiao Zhu, Xiaoxiao Li, Lifang He, Carl Yang

Abstract: Interpretable brain network models for disease prediction are of great value for the advancement of neuroscience. GNNs are promising to model complicated network data, but they are prone to overfitting and suffer from poor interpretability, which prevents their usage in decision-critical scenarios like healthcare. To bridge this gap, we propose BrainNNExplainer, an interpretable GNN framework for… ▽ More Interpretable brain network models for disease prediction are of great value for the advancement of neuroscience. GNNs are promising to model complicated network data, but they are prone to overfitting and suffer from poor interpretability, which prevents their usage in decision-critical scenarios like healthcare. To bridge this gap, we propose BrainNNExplainer, an interpretable GNN framework for brain network analysis. It is mainly composed of two jointly learned modules: a backbone prediction model that is specifically designed for brain networks and an explanation generator that highlights disease-specific prominent brain network connections. Extensive experimental results with visualizations on two challenging disease prediction datasets demonstrate the unique interpretability and outstanding performance of BrainNNExplainer. △ Less

Submitted 11 July, 2021; originally announced July 2021.

Comments: This paper has been accepted to ICML 2021 Workshop on Interpretable Machine Learning in Healthcare

MSC Class: 68T07; 68T45; 68T20 ACM Class: I.2.6; I.2.10; J.3

arXiv:2107.03220 [pdf, other]

Joint Embedding of Structural and Functional Brain Networks with Graph Neural Networks for Mental Illness Diagnosis

Authors: Yanqiao Zhu, Hejie Cui, Lifang He, Lichao Sun, Carl Yang

Abstract: Multimodal brain networks characterize complex connectivities among different brain regions from both structural and functional aspects and provide a new means for mental disease analysis. Recently, Graph Neural Networks (GNNs) have become a de facto model for analyzing graph-structured data. However, how to employ GNNs to extract effective representations from brain networks in multiple modalitie… ▽ More Multimodal brain networks characterize complex connectivities among different brain regions from both structural and functional aspects and provide a new means for mental disease analysis. Recently, Graph Neural Networks (GNNs) have become a de facto model for analyzing graph-structured data. However, how to employ GNNs to extract effective representations from brain networks in multiple modalities remains rarely explored. Moreover, as brain networks provide no initial node features, how to design informative node attributes and leverage edge weights for GNNs to learn is left unsolved. To this end, we develop a novel multiview GNN for multimodal brain networks. In particular, we regard each modality as a view for brain networks and employ contrastive learning for multimodal fusion. Then, we propose a GNN model which takes advantage of the message passing scheme by propagating messages based on degree statistics and brain region connectivities. Extensive experiments on two real-world disease datasets (HIV and Bipolar) demonstrate the effectiveness of our proposed method over state-of-the-art baselines. △ Less

Submitted 24 May, 2022; v1 submitted 7 July, 2021; originally announced July 2021.

Comments: Formal version accepted to IEEE EMBC 2022; previously presented at ICML 2021 Workshop on Computational Approaches to Mental Health (no proceedings)

arXiv:2106.14362 [pdf]

Photoacoustic Silk Scaffolds for Neural stimulation and Regeneration

Authors: Nan Zheng, Vincent Fitzpatrick, Ran Cheng, Linli Shi, David L. Kaplan, Chen Yang

Abstract: Neural interfaces using biocompatible scaffolds provide crucial properties for the functional repair of nerve injuries and neurodegenerative diseases, including cell adhesion, structural support, and mass transport. Neural stimulation has also been found to be effective in promoting neural regeneration. This work provides a new strategy to integrate photoacoustic (PA) neural stimulation into hydro… ▽ More Neural interfaces using biocompatible scaffolds provide crucial properties for the functional repair of nerve injuries and neurodegenerative diseases, including cell adhesion, structural support, and mass transport. Neural stimulation has also been found to be effective in promoting neural regeneration. This work provides a new strategy to integrate photoacoustic (PA) neural stimulation into hydrogel scaffolds using a nanocomposite hydrogel approach. Specifically, polyethylene glycol (PEG)-functionalized carbon nanotubes (CNT), highly efficient photoacoustic agents, are embedded into silk fibroin to form biocompatible and soft photoacoustic materials. We show that these photoacoustic functional scaffolds enable non-genetic activation of neurons with a spatial precision defined by the area of light illumination, promoting neuron regeneration. These CNT/silk scaffolds offered reliable and repeatable photoacoustic neural stimulation. 94% of photoacoustic stimulated neurons exhibit a fluorescence change larger than 10% in calcium imaging in the light illuminated area. The on-demand photoacoustic stimulation increased neurite outgrowth by 1.74-fold in a dorsal root ganglion model, when compared to the unstimulated group. We also confirmed that photoacoustic neural stimulation promoted neurite outgrowth by impacting the brain-derived neurotrophic factor (BDNF) pathway. As a multifunctional neural scaffold, CNT/silk scaffolds demonstrated non-genetic PA neural stimulation functions and promoted neurite outgrowth, providing a new method for non-pharmacological neural regeneration. △ Less

Submitted 27 June, 2021; originally announced June 2021.

arXiv:2104.00770 [pdf, other]

Dynamical-System Model Predicts When Social Learners Impair Collective Performance

Authors: Vicky Chuqiao Yang, Mirta Galesic, Harvey McGuinness, Ani Harutyunyan

Abstract: A key question concerning collective decisions is whether a social system can settle on the best available option when some members learn from others instead of evaluating the options on their own. This question is challenging to study, and previous research has reached mixed conclusions, because collective decision outcomes depend on the insufficiently understood complex system of cognitive strat… ▽ More A key question concerning collective decisions is whether a social system can settle on the best available option when some members learn from others instead of evaluating the options on their own. This question is challenging to study, and previous research has reached mixed conclusions, because collective decision outcomes depend on the insufficiently understood complex system of cognitive strategies, task properties, and social influence processes. This study integrates these complex interactions together in one general yet partially analytically tractable mathematical framework using a dynamical system model. In particular, it investigates how the interplay of the proportion of social learners, the relative merit of options, and the type of conformity response affect collective decision outcomes in a binary choice. The model predicts that when the proportion of social learners exceeds a critical threshold, a bi-stable state appears in which the majority can end up favoring either the higher- or lower-merit option, depending on fluctuations and initial conditions. Below this threshold, the high-merit option is chosen by the majority. The critical threshold is determined by the conformity response function and the relative merits of the two options. The study helps reconcile disagreements about the effect of social learners on collective performance and proposes a mathematical framework that can be readily adapted to extensions investigating a wider variety of dynamics. △ Less

Submitted 20 August, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

Comments: 10 pages, 3 figures

arXiv:2012.09930 [pdf]

Non-genetic acoustic stimulation of single neurons by a tapered fiber optoacoustic emitter

Authors: Linli Shi, Ying Jiang, Fernando R. Fernandez, Lu Lan, Guo Chen, Heng-ye Man, John A. White, Ji-Xin Cheng, Chen Yang

Abstract: As an emerging technology, transcranial focused ultrasound has been demonstrated to successfully evoke motor responses in mice, rabbits, and sensory/motor responses in humans. Yet, the spatial resolution of ultrasound does not allow for high-precision stimulation. Here, we developed a tapered fiber optoacoustic emitter (TFOE) for optoacoustic stimulation of neurons with an unprecedented spatial re… ▽ More As an emerging technology, transcranial focused ultrasound has been demonstrated to successfully evoke motor responses in mice, rabbits, and sensory/motor responses in humans. Yet, the spatial resolution of ultrasound does not allow for high-precision stimulation. Here, we developed a tapered fiber optoacoustic emitter (TFOE) for optoacoustic stimulation of neurons with an unprecedented spatial resolution of 20 microns, enabling selective activation of single neurons or subcellular structures, such as axons and dendrites. A single acoustic pulse of 1 microsecond converted by the TFOE from a single laser pulse of 3 nanoseconds is shown as the shortest acoustic stimuli so far for successful neuron activation. The highly localized ultrasound generated by the TFOE made it possible to integrate the optoacoustic stimulation and highly stable patch clamp recording on single neurons. Direct measurements of electrical response of single neurons to acoustic stimulation, which is difficult for conventional ultrasound stimulation, have been demonstrated for the first time. By coupling TFOE with ex vivo brain slice electrophysiology, we unveil cell-type-specific response of excitatory and inhibitory neurons to acoustic stimulation. These results demonstrate that TFOE is a non-genetic single-cell and sub-cellular modulation technology, which could shed new insights into the mechanism of neurostimulation. △ Less

Submitted 17 December, 2020; originally announced December 2020.

Comments: 25 pages, 5 figures

arXiv:2012.00672 [pdf]

Dynamics-based peptide-MHC binding optimization by a convolutional variational autoencoder: a use-case model for CASTELO

Authors: David Bell, Giacomo Domeniconi, Chih-Chieh Yang, Ruhong Zhou, Leili Zhang, Guojing Cong

Abstract: An unsolved challenge in the development of antigen specific immunotherapies is determining the optimal antigens to target. Comprehension of antigen-MHC binding is paramount towards achieving this goal. Here, we present CASTELO, a combined machine learning-molecular dynamics (ML-MD) approach to design novel antigens of increased MHC binding affinity for a Type 1 diabetes (T1D)-implicated system. W… ▽ More An unsolved challenge in the development of antigen specific immunotherapies is determining the optimal antigens to target. Comprehension of antigen-MHC binding is paramount towards achieving this goal. Here, we present CASTELO, a combined machine learning-molecular dynamics (ML-MD) approach to design novel antigens of increased MHC binding affinity for a Type 1 diabetes (T1D)-implicated system. We build upon a small molecule lead optimization algorithm by training a convolutional variational autoencoder (CVAE) on MD trajectories of 48 different systems across 4 antigens and 4 HLA serotypes. We develop several new machine learning metrics including a structure-based anchor residue classification model as well as cluster comparison scores. ML-MD predictions agree well with experimental binding results and free energy perturbation-predicted binding affinities. Moreover, ML-MD metrics are independent of traditional MD stability metrics such as contact area and RMSF, which do not reflect binding affinity data. Our work supports the role of structure-based deep learning techniques in antigen specific immunotherapy design. △ Less

Submitted 8 December, 2020; v1 submitted 29 November, 2020; originally announced December 2020.

arXiv:2011.01508 [pdf, other]

Greetings from a Triparental Planet

Authors: Gizem Bacaksizlar, Stefani Crabtree, Joshua Garland, Natalie Grefenstette, Albert Kao, David Kinney, Artemy Kolchinsky, Tyler Marghetis, Michael Price, Maria Riolo, Hajime Shimao, Ashley Teufel, Tamara van der Does, Vicky Chuqiao Yang

Abstract: In this work of speculative science, scientists from a distant star system explain the emergence and consequences of triparentalism, when three individuals are required for sexual reproduction, which is the standard form of mating on their home world. The report details the evolution of their reproductive system--that is, the conditions under which triparentalism and three self-avoiding mating typ… ▽ More In this work of speculative science, scientists from a distant star system explain the emergence and consequences of triparentalism, when three individuals are required for sexual reproduction, which is the standard form of mating on their home world. The report details the evolution of their reproductive system--that is, the conditions under which triparentalism and three self-avoiding mating types emerged as advantageous strategies for sexual reproduction. It also provides an overview of the biological consequences of triparental reproduction with three mating types, including the genetic mechanisms of triparental reproduction, asymmetries between the three mating types, and infection dynamics arising from their different mode of sexual reproduction. The report finishes by discussing how central aspects of their society, such as short-lasting unions among individuals and the rise of a monoculture, might have arisen as a result of their triparental system. △ Less

Submitted 3 November, 2020; originally announced November 2020.

Comments: The original version of this report was produced by a team in just 72 hours. This version includes additional edits for style and formatting

arXiv:2008.05661 [pdf, other]

doi 10.1137/19M1254246

Why are U.S. Parties So Polarized? A "Satisficing" Dynamical Model

Authors: Vicky Chuqiao Yang, Daniel M. Abrams, Georgia Kernell, Adilson E. Motter

Abstract: Since the 1960s, Democrats and Republicans in U.S. Congress have taken increasingly polarized positions, while the public's policy positions have remained centrist and moderate. We explain this apparent contradiction by developing a dynamical model that predicts ideological positions of political parties. Our approach tackles the challenge of incorporating bounded rationality into mathematical mod… ▽ More Since the 1960s, Democrats and Republicans in U.S. Congress have taken increasingly polarized positions, while the public's policy positions have remained centrist and moderate. We explain this apparent contradiction by developing a dynamical model that predicts ideological positions of political parties. Our approach tackles the challenge of incorporating bounded rationality into mathematical models and integrates the empirical finding of satisficing decision making---voters settle for candidates who are "good enough" when deciding for whom to vote. We test the model using data from the U.S. Congress over the past 150 years, and find that our predictions are consistent with the two major political parties' historical trajectory. In particular, the model explains how polarization between the Democrats and Republicans since the 1960s could be a consequence of increasing ideological homogeneity within the parties. △ Less

Submitted 14 August, 2020; v1 submitted 12 August, 2020; originally announced August 2020.

Comments: 13 pages, 5 figures

Journal ref: SIAM Review, 2020, 62(3), 646-657

arXiv:2005.04224 [pdf]

doi 10.1080/17538947.2020.1809723

Taking the pulse of COVID-19: A spatiotemporal perspective

Authors: Chaowei Yang, Dexuan Sha, Qian Liu, Yun Li, Hai Lan, Weihe Wendy Guan, Tao Hu, Zhenlong Li, Zhiran Zhang, John Hoot Thompson, Zifu Wang, David Wong, Shiyang Ruan, Manzhu Yu, Douglas Richardson, Luyao Zhang, Ruizhi Hou, You Zhou, Cheng Zhong, Yifei Tian, Fayez Beaini, Kyla Carte, Colin Flynn, Wei Liu, Dieter Pfoser , et al. (10 additional authors not shown)

Abstract: The sudden outbreak of the Coronavirus disease (COVID-19) swept across the world in early 2020, triggering the lockdowns of several billion people across many countries, including China, Spain, India, the U.K., Italy, France, Germany, and most states of the U.S. The transmission of the virus accelerated rapidly with the most confirmed cases in the U.S., and New York City became an epicenter of the… ▽ More The sudden outbreak of the Coronavirus disease (COVID-19) swept across the world in early 2020, triggering the lockdowns of several billion people across many countries, including China, Spain, India, the U.K., Italy, France, Germany, and most states of the U.S. The transmission of the virus accelerated rapidly with the most confirmed cases in the U.S., and New York City became an epicenter of the pandemic by the end of March. In response to this national and global emergency, the NSF Spatiotemporal Innovation Center brought together a taskforce of international researchers and assembled implemented strategies to rapidly respond to this crisis, for supporting research, saving lives, and protecting the health of global citizens. This perspective paper presents our collective view on the global health emergency and our effort in collecting, analyzing, and sharing relevant data on global policy and government responses, geospatial indicators of the outbreak and evolving forecasts; in developing research capabilities and mitigation measures with global scientists, promoting collaborative research on outbreak dynamics, and reflecting on the dynamic responses from human societies. △ Less

Submitted 8 May, 2020; originally announced May 2020.

Comments: 27 pages, 18 figures. International Journal of Digital Earth (2020)

arXiv:2004.05715 [pdf, other]

Modeling the transmission of new coronavirus in São Paulo State, Brazil -- Assessing epidemiological impacts of isolating young and elder persons

Authors: Hyun Mo Yang, Luis Pedro Lombardi Junior, Ariana Campos Yang

Abstract: We developed a mathematical model to describe the transmission of new coronavirus in the São Paulo State, Brazil. The model divided a community in subpopulations comprised by young and elder persons, in order to take into account higher risk of fatality among elder persons with severe CoViD-19. From data collected in the São Paulo State, we estimated the transmission and additional mortality rates… ▽ More We developed a mathematical model to describe the transmission of new coronavirus in the São Paulo State, Brazil. The model divided a community in subpopulations comprised by young and elder persons, in order to take into account higher risk of fatality among elder persons with severe CoViD-19. From data collected in the São Paulo State, we estimated the transmission and additional mortality rates, from which we calculated the basic reproduction number R0. From estimated parameters, estimation of the deaths due to CoViD-19 was three times lower than those found in literature. Considering isolation as a control mechanism, we varied isolation rates of young and elder persons in order to assess their epidemiological impacts. The epidemiological scenarios focused mainly on evaluating the number of severe CoViD-19 cases and deaths due to this disease when isolation is introduced in a population. △ Less

Submitted 12 April, 2020; originally announced April 2020.

Comments: 33 pages, 25 figures

arXiv:2002.00539 [pdf, other]

doi 10.1109/CEC48606.2020.9185648

Evolving Neural Networks through a Reverse Encoding Tree

Authors: Haoling Zhang, Chao-Han Huck Yang, Hector Zenil, Narsis A. Kiani, Yue Shen, Jesper N. Tegner

Abstract: NeuroEvolution is one of the most competitive evolutionary learning frameworks for designing novel neural networks for use in specific tasks, such as logic circuit design and digital gaming. However, the application of benchmark methods such as the NeuroEvolution of Augmenting Topologies (NEAT) remains a challenge, in terms of their computational cost and search time inefficiency. This paper advan… ▽ More NeuroEvolution is one of the most competitive evolutionary learning frameworks for designing novel neural networks for use in specific tasks, such as logic circuit design and digital gaming. However, the application of benchmark methods such as the NeuroEvolution of Augmenting Topologies (NEAT) remains a challenge, in terms of their computational cost and search time inefficiency. This paper advances a method which incorporates a type of topological edge coding, named Reverse Encoding Tree (RET), for evolving scalable neural networks efficiently. Using RET, two types of approaches -- NEAT with Binary search encoding (Bi-NEAT) and NEAT with Golden-Section search encoding (GS-NEAT) -- have been designed to solve problems in benchmark continuous learning environments such as logic gates, Cartpole, and Lunar Lander, and tested against classical NEAT and FS-NEAT as baselines. Additionally, we conduct a robustness test to evaluate the resilience of the proposed NEAT algorithms. The results show that the two proposed strategies deliver improved performance, characterized by (1) a higher accumulated reward within a finite number of time steps; (2) using fewer episodes to solve problems in targeted environments, and (3) maintaining adaptive robustness under noisy perturbations, which outperform the baselines in all tested cases. Our analysis also demonstrates that RET expends potential future research directions in dynamic environments. Code is available from https://github.com/HaolingZHANG/ReverseEncodingTree. △ Less

Submitted 31 March, 2020; v1 submitted 2 February, 2020; originally announced February 2020.

Comments: Accepted to IEEE Congress on Evolutionary Computation (IEEE CEC) 2020. Lecture Presentation

Journal ref: 2020 IEEE Congress on Evolutionary Computation (CEC)

arXiv:1911.10419 [pdf, other]

Falling Through the Cracks: Modeling the Formation of Social Category Boundaries

Authors: Vicky Chuqiao Yang, Tamara van der Does, Henrik Olsson

Abstract: Social categorizations divide people into "us" and "them," often along continuous attributes such as political ideology or skin color. This division results in both positive consequences, such as a sense of community, and negative ones, such as group conflict. Further, individuals in the middle of the spectrum can fall through the cracks of this categorization process and are seen as out-group by… ▽ More Social categorizations divide people into "us" and "them," often along continuous attributes such as political ideology or skin color. This division results in both positive consequences, such as a sense of community, and negative ones, such as group conflict. Further, individuals in the middle of the spectrum can fall through the cracks of this categorization process and are seen as out-group by individuals on either side of the spectrum, becoming inbetweeners. Here, we propose a quantitative, dynamical-system model that studies the joint influence of cognitive and social processes. We model where two social groups draw the boundaries between "us" and "them" on a continuous attribute. Our model predicts that both groups tend to draw a more restrictive boundary than the middle of the spectrum. As a result, each group sees the individuals in the middle of the attribute space as an out-group. We test this prediction using U.S. political survey data on how political independents are perceived by registered party members as well as existing experiments on the perception of racially ambiguous faces, and find support. △ Less

Submitted 26 February, 2021; v1 submitted 23 November, 2019; originally announced November 2019.

Comments: 16 pages, 6 figures

MSC Class: 91C99; 37N99

arXiv:1811.05592 [pdf]

Controllability, Multiplexing, and Transfer Learning in Networks using Evolutionary Learning

Authors: Rise Ooi, Chao-Han Huck Yang, Pin-Yu Chen, Vìctor Eguìluz, Narsis Kiani, Hector Zenil, David Gomez-Cabrero, Jesper Tegnèr

Abstract: Networks are fundamental building blocks for representing data, and computations. Remarkable progress in learning in structurally defined (shallow or deep) networks has recently been achieved. Here we introduce evolutionary exploratory search and learning method of topologically flexible networks under the constraint of producing elementary computational steady-state input-output operations. Our… ▽ More Networks are fundamental building blocks for representing data, and computations. Remarkable progress in learning in structurally defined (shallow or deep) networks has recently been achieved. Here we introduce evolutionary exploratory search and learning method of topologically flexible networks under the constraint of producing elementary computational steady-state input-output operations. Our results include; (1) the identification of networks, over four orders of magnitude, implementing computation of steady-state input-output functions, such as a band-pass filter, a threshold function, and an inverse band-pass function. Next, (2) the learned networks are technically controllable as only a small number of driver nodes are required to move the system to a new state. Furthermore, we find that the fraction of required driver nodes is constant during evolutionary learning, suggesting a stable system design. (3), our framework allows multiplexing of different computations using the same network. For example, using a binary representation of the inputs, the network can readily compute three different input-output functions. Finally, (4) the proposed evolutionary learning demonstrates transfer learning. If the system learns one function A, then learning B requires on average less number of steps as compared to learning B from tabula rasa. We conclude that the constrained evolutionary learning produces large robust controllable circuits, capable of multiplexing and transfer learning. Our study suggests that network-based computations of steady-state functions, representing either cellular modules of cell-to-cell communication networks or internal molecular circuits communicating within a cell, could be a powerful model for biologically inspired computing. This complements conceptualizations such as attractor based models, or reservoir computing. △ Less

Submitted 3 November, 2019; v1 submitted 13 November, 2018; originally announced November 2018.

Comments: A revised version. (word source code to pdf; owing to the algo package conflicts)

arXiv:1810.13041 [pdf, other]

doi 10.1039/C8SM02170H

Concurrent coupling of atomistic simulation and mesoscopic hydrodynamics for flows over soft multi-functional surfaces

Authors: Yuying Wang, Zhen Li, Junbo Xu, Chao Yang, George Em Karniadakis

Abstract: We develop an efficient parallel multiscale method that bridges the atomistic and mesoscale regimes, from nanometer to micron and beyond, via concurrent coupling of atomistic simulation and mesoscopic dynamics. In particular, we combine an all-atom molecular dynamics (MD) description for specific atomistic details in the vicinity of the functional surface, with a dissipative particle dynamics (DPD… ▽ More We develop an efficient parallel multiscale method that bridges the atomistic and mesoscale regimes, from nanometer to micron and beyond, via concurrent coupling of atomistic simulation and mesoscopic dynamics. In particular, we combine an all-atom molecular dynamics (MD) description for specific atomistic details in the vicinity of the functional surface, with a dissipative particle dynamics (DPD) approach that captures mesoscopic hydrodynamics in the domain away from the functional surface. In order to achieve a seamless transition in dynamic properties we endow the MD simulation with a DPD thermostat, which is validated against experimental results by modeling water at different temperatures. We then validate the MD-DPD coupling method for transient Couette and Poiseuille flows, demonstrating that the concurrent MD-DPD coupling can resolve accurately the continuum-based analytical solutions. Subsequently, we simulate shear flows over polydimethylsiloxane (PDMS)-grafted surfaces (polymer brushes) for various grafting densities, and investigate the slip flow as a function of the shear stress. We verify that a "universal" power law exists for the sliplength, in agreement with published results. Having validated the MD-DPD coupling method, we simulate time-dependent flows past an endothelial glycocalyx layer (EGL) in a microchannel. Coupled simulation results elucidate the dynamics of EGL changing from an equilibrium state to a compressed state under shear by aligning the molecular structures along the shear direction. MD-DPD simulation results agree well with results of a single MD simulation, but with the former more than two orders of magnitude faster than the latter for system sizes above one micron. △ Less

Submitted 27 October, 2018; originally announced October 2018.

Comments: 11 pages, 12 figures

Journal ref: Soft Matter, 2019,15: 1747-1757

arXiv:1809.06899 [pdf, other]

Testing Selective Influence Directly Using Trackball Movement Tasks

Authors: Ru Zhang, Cheng-Ta Yang, Janne V. Kujala

Abstract: Systems factorial technology (SFT; Townsend & Nozawa, 1995) is regarded as a useful tool to diagnose if features (or dimensions) of the investigated stimulus are processed in a parallel or serial fashion. In order to use SFT, one has to assume the speed to process each feature is influenced by that feature only, termed as selective influence (Sternberg, 1969). This assumption is usually untestable… ▽ More Systems factorial technology (SFT; Townsend & Nozawa, 1995) is regarded as a useful tool to diagnose if features (or dimensions) of the investigated stimulus are processed in a parallel or serial fashion. In order to use SFT, one has to assume the speed to process each feature is influenced by that feature only, termed as selective influence (Sternberg, 1969). This assumption is usually untestable as the processing time for a stimulus feature is not observable. Stochastic dominance is traditionally used as an indirect evidence for selective influence (e.g., Townsend & Fifić, 2004). However, one should keep in mind that selective influence may be violated even when stochastic dominance holds. The current study proposes a trackball movement paradigm for a direct test of selective influence. The participants were shown a reference stimulus and a test stimulus simultaneously on a computer screen. They were asked to use the trackball to adjust the test stimulus until it appeared to match the position or shape of the reference stimulus. We recorded the reaction time, the parameters defined the reference stimulus (denoted as αand β), and the parameters defined the test stimulus (denoted as A and B). We tested selective influence of αand βon the amount of time to adjust A and B through testing selective influence of αand βon the values of A and B using the linear feasibility test (Dzhafarov & Kujala, 2010). We found that when the test was passed and stochastic dominance held, the inferred architecture was as expected, which was further confirmed by the trajectory of A and B observed in each trial. However, with stochastic dominance only SFT can suggest a prohibited architecture. Our results indicate the proposed method is more reliable for testing selective influence on the processing speed than examining stochastic dominance only. △ Less

Submitted 18 September, 2018; originally announced September 2018.

arXiv:1804.11011 [pdf, other]

Joint Analysis of Individual-level and Summary-level GWAS Data by Leveraging Pleiotropy

Authors: Mingwei Dai, Xiang Wan, Hao Peng, Yao Wang, Yue Liu, Jin Liu, Zongben Xu, Can Yang

Abstract: A large number of recent genome-wide association studies (GWASs) for complex phenotypes confirm the early conjecture for polygenicity, suggesting the presence of large number of variants with only tiny or moderate effects. However, due to the limited sample size of a single GWAS, many associated genetic variants are too weak to achieve the genome-wide significance. These undiscovered variants furt… ▽ More A large number of recent genome-wide association studies (GWASs) for complex phenotypes confirm the early conjecture for polygenicity, suggesting the presence of large number of variants with only tiny or moderate effects. However, due to the limited sample size of a single GWAS, many associated genetic variants are too weak to achieve the genome-wide significance. These undiscovered variants further limit the prediction capability of GWAS. Restricted access to the individual-level data and the increasing availability of the published GWAS results motivate the development of methods integrating both the individual-level and summary-level data. How to build the connection between the individual-level and summary-level data determines the efficiency of using the existing abundant summary-level resources with limited individual-level data, and this issue inspires more efforts in the existing area. In this study, we propose a novel statistical approach, LEP, which provides a novel way of modeling the connection between the individual-level data and summary-level data. LEP integrates both types of data by \underline{LE}veraing \underline{P}leiotropy to increase the statistical power of risk variants identification and the accuracy of risk prediction. The algorithm for parameter estimation is developed to handle genome-wide-scale data. Through comprehensive simulation studies, we demonstrated the advantages of LEP over the existing methods. We further applied LEP to perform integrative analysis of Crohn's disease from WTCCC and summary statistics from GWAS of some other diseases, such as Type 1 diabetes, Ulcerative colitis and Primary biliary cirrhosis. LEP was able to significantly increase the statistical power of identifying risk variants and improve the risk prediction accuracy from 63.39\% ($\pm$ 0.58\%) to 68.33\% ($\pm$ 0.32\%) using about 195,000 variants. △ Less

Submitted 29 April, 2018; originally announced April 2018.

Comments: 32 pages, 11 figures, 2 tables

arXiv:1712.07292 [pdf]

doi 10.1038/s41467-018-05845-7

Causal Decomposition in the Mutual Causation System

Authors: Albert C. Yang, Norden E. Huang, Chung-Kang Peng

Abstract: Inference of causality in time series has been principally based on the prediction paradigm. Nonetheless, the predictive causality approach may overlook the simultaneous and reciprocal nature of causal interactions observed in real world phenomena. Here, we present a causal decomposition approach that is not based on prediction, but based on the instantaneous phase dependency between the intrinsic… ▽ More Inference of causality in time series has been principally based on the prediction paradigm. Nonetheless, the predictive causality approach may overlook the simultaneous and reciprocal nature of causal interactions observed in real world phenomena. Here, we present a causal decomposition approach that is not based on prediction, but based on the instantaneous phase dependency between the intrinsic components of a decomposed time series. The method involves two assumptions: (1) any cause effect relationship can be quantified with instantaneous phase dependency between the source and target decomposed as intrinsic components at specific time scale, and (2) the phase dynamics in the target originating from the source are separable from the target itself. Using empirical mode decomposition, we show that the causal interaction is encoded in instantaneous phase dependency at a specific time scale, and this phase dependency is diminished when the causal-related intrinsic component is removed from the effect. Furthermore, we demonstrate the generic applicability of our method to both stochastic and deterministic systems, and show the consistency of the causal decomposition method compared to existing methods, and finally uncover the key mode of causal interactions in both the modelled and actual predator prey system. We anticipate that this novel approach will assist with revealing causal interactions in complex networks not accounted for by current methods. △ Less

Submitted 19 December, 2017; originally announced December 2017.

Comments: 5 figures

arXiv:1712.00476 [pdf, other]

doi 10.1103/PhysRevE.100.032306

Modeling the origin of urban output scaling laws

Authors: Vicky Chuqiao Yang, Andrew V. Papachristos, Daniel M. Abrams

Abstract: Urban outputs often scale superlinearly with city population. A difficulty in understanding the mechanism of this phenomenon is that different outputs differ considerably in their scaling behaviors. Here, we formulate a physics-based model for the origin of superlinear scaling in urban outputs by treating human interaction as a random process. Our model suggests that the increased likelihood of fi… ▽ More Urban outputs often scale superlinearly with city population. A difficulty in understanding the mechanism of this phenomenon is that different outputs differ considerably in their scaling behaviors. Here, we formulate a physics-based model for the origin of superlinear scaling in urban outputs by treating human interaction as a random process. Our model suggests that the increased likelihood of finding required collaborations in a larger population can explain this superlinear scaling, which our model predicts to be non-power-law. Moreover, the extent of superlinearity should be greater for activities that require more collaborators. We test this model using a novel dataset for seven crime types and find strong support. △ Less

Submitted 26 August, 2019; v1 submitted 1 December, 2017; originally announced December 2017.

Comments: 8 pages, 5 figures

Journal ref: Phys. Rev. E 100, 032306 (2019)

arXiv:1710.07201 [pdf, other]

LSMM: A statistical approach to integrating functional annotations with genome-wide association studies

Authors: Jingsi Ming, Mingwei Dai, Mingxuan Cai, Xiang Wan, Jin Liu, Can Yang

Abstract: Thousands of risk variants underlying complex phenotypes (quantitative traits and diseases) have been identified in genome-wide association studies (GWAS). However, there are still two major challenges towards deepening our understanding of the genetic architectures of complex phenotypes. First, the majority of GWAS hits are in the non-coding region and their biological interpretation is still unc… ▽ More Thousands of risk variants underlying complex phenotypes (quantitative traits and diseases) have been identified in genome-wide association studies (GWAS). However, there are still two major challenges towards deepening our understanding of the genetic architectures of complex phenotypes. First, the majority of GWAS hits are in the non-coding region and their biological interpretation is still unclear. Second, accumulating evidence from GWAS suggests the polygenicity of complex traits, i.e., a complex trait is often affected by many variants with small or moderate effects, whereas a large proportion of risk variants with small effects remains unknown. The availability of functional annotation data enables us to address the above challenges. In this study, we propose a latent sparse mixed model (LSMM) to integrate functional annotations with GWAS data. Not only does it increase statistical power of the identification of risk variants, but also offers more biological insights by detecting relevant functional annotations. To allow LSMM scalable to millions of variants and hundreds of functional annotations, we developed an efficient variational expectation-maximization (EM) algorithm for model parameter estimation and statistical inference. We first conducted comprehensive simulation studies to evaluate the performance of LSMM. Then we applied it to analyze 30 GWAS of complex phenotypes integrated with 9 genic category annotations and 127 tissue-specific functional annotations from the Roadmap project. The results demonstrate that our method possesses more statistical power over conventional methods, and can help researchers achieve deeper understanding of genetic architecture of these complex phenotypes. △ Less

Submitted 19 October, 2017; originally announced October 2017.

Showing 1–50 of 61 results for author: Yang, C