-
NeuroBind: Towards Unified Multimodal Representations for Neural Signals
Authors:
Fengyu Yang,
Chao Feng,
Daniel Wang,
Tianye Wang,
Ziyao Zeng,
Zhiyang Xu,
Hyoungseob Park,
Pengliang Ji,
Hanbin Zhao,
Yuanning Li,
Alex Wong
Abstract:
Understanding neural activity and information representation is crucial for advancing knowledge of brain function and cognition. Neural activity, measured through techniques like electrophysiology and neuroimaging, reflects various aspects of information processing. Recent advances in deep neural networks offer new approaches to analyzing these signals using pre-trained models. However, challenges…
▽ More
Understanding neural activity and information representation is crucial for advancing knowledge of brain function and cognition. Neural activity, measured through techniques like electrophysiology and neuroimaging, reflects various aspects of information processing. Recent advances in deep neural networks offer new approaches to analyzing these signals using pre-trained models. However, challenges arise due to discrepancies between different neural signal modalities and the limited scale of high-quality neural data. To address these challenges, we present NeuroBind, a general representation that unifies multiple brain signal types, including EEG, fMRI, calcium imaging, and spiking data. To achieve this, we align neural signals in these image-paired neural datasets to pre-trained vision-language embeddings. Neurobind is the first model that studies different neural modalities interconnectedly and is able to leverage high-resource modality models for various neuroscience tasks. We also showed that by combining information from different neural signal modalities, NeuroBind enhances downstream performance, demonstrating the effectiveness of the complementary strengths of different neural modalities. As a result, we can leverage multiple types of neural signals mapped to the same space to improve downstream tasks, and demonstrate the complementary strengths of different neural modalities. This approach holds significant potential for advancing neuroscience research, improving AI systems, and developing neuroprosthetics and brain-computer interfaces.
△ Less
Submitted 19 July, 2024;
originally announced July 2024.
-
Exploring Post-Training Quantization of Protein Language Models
Authors:
Shuang Peng,
Fei Yang,
Ning Sun,
Sheng Chen,
Yanfeng Jiang,
Aimin Pan
Abstract:
Recent advancements in unsupervised protein language models (ProteinLMs), like ESM-1b and ESM-2, have shown promise in different protein prediction tasks. However, these models face challenges due to their high computational demands, significant memory needs, and latency, restricting their usage on devices with limited resources. To tackle this, we explore post-training quantization (PTQ) for Prot…
▽ More
Recent advancements in unsupervised protein language models (ProteinLMs), like ESM-1b and ESM-2, have shown promise in different protein prediction tasks. However, these models face challenges due to their high computational demands, significant memory needs, and latency, restricting their usage on devices with limited resources. To tackle this, we explore post-training quantization (PTQ) for ProteinLMs, focusing on ESMFold, a simplified version of AlphaFold based on ESM-2 ProteinLM. Our study is the first attempt to quantize all weights and activations of ProteinLMs. We observed that the typical uniform quantization method performs poorly on ESMFold, causing a significant drop in TM-Score when using 8-bit quantization. We conducted extensive quantization experiments, uncovering unique challenges associated with ESMFold, particularly highly asymmetric activation ranges before Layer Normalization, making representation difficult using low-bit fixed-point formats. To address these challenges, we propose a new PTQ method for ProteinLMs, utilizing piecewise linear quantization for asymmetric activation values to ensure accurate approximation. We demonstrated the effectiveness of our method in protein structure prediction tasks, demonstrating that ESMFold can be accurately quantized to low-bit widths without compromising accuracy. Additionally, we applied our method to the contact prediction task, showcasing its versatility. In summary, our study introduces an innovative PTQ method for ProteinLMs, addressing specific quantization challenges and potentially leading to the development of more efficient ProteinLMs with significant implications for various protein-related applications.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
Exploring evolution-aware & -free protein language models as protein function predictors
Authors:
Mingyang Hu,
Fajie Yuan,
Kevin K. Yang,
Fusong Ju,
Jin Su,
Hui Wang,
Fei Yang,
Qiuyang Ding
Abstract:
Large-scale Protein Language Models (PLMs) have improved performance in protein prediction tasks, ranging from 3D structure prediction to various function predictions. In particular, AlphaFold, a ground-breaking AI system, could potentially reshape structural biology. However, the utility of the PLM module in AlphaFold, Evoformer, has not been explored beyond structure prediction. In this paper, w…
▽ More
Large-scale Protein Language Models (PLMs) have improved performance in protein prediction tasks, ranging from 3D structure prediction to various function predictions. In particular, AlphaFold, a ground-breaking AI system, could potentially reshape structural biology. However, the utility of the PLM module in AlphaFold, Evoformer, has not been explored beyond structure prediction. In this paper, we investigate the representation ability of three popular PLMs: ESM-1b (single sequence), MSA-Transformer (multiple sequence alignment) and Evoformer (structural), with a special focus on Evoformer. Specifically, we aim to answer the following key questions: (i) Does the Evoformer trained as part of AlphaFold produce representations amenable to predicting protein function? (ii) If yes, can Evoformer replace ESM-1b and MSA-Transformer? (ii) How much do these PLMs rely on evolution-related protein data? In this regard, are they complementary to each other? We compare these models by empirical study along with new insights and conclusions. All code and datasets for reproducibility are available at https://github.com/elttaes/Revisiting-PLMs.
△ Less
Submitted 16 October, 2022; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Matching Theory and Evidence on Covid-19 using a Stochastic Network SIR Model
Authors:
M. Hashem Pesaran,
Cynthia Fan Yang
Abstract:
This paper develops an individual-based stochastic network SIR model for the empirical analysis of the Covid-19 pandemic. It derives moment conditions for the number of infected and active cases for single as well as multigroup epidemic models. These moment conditions are used to investigate the identification and estimation of the transmission rates. The paper then proposes a method that jointly…
▽ More
This paper develops an individual-based stochastic network SIR model for the empirical analysis of the Covid-19 pandemic. It derives moment conditions for the number of infected and active cases for single as well as multigroup epidemic models. These moment conditions are used to investigate the identification and estimation of the transmission rates. The paper then proposes a method that jointly estimates the transmission rate and the magnitude of under-reporting of infected cases. Empirical evidence on six European countries matches the simulated outcomes once the under-reporting of infected cases is addressed. It is estimated that the number of actual cases could be between 4 to 10 times higher than the reported numbers in October 2020 and declined to 2 to 3 times in April 2021. The calibrated models are used in the counterfactual analyses of the impact of social distancing and vaccination on the epidemic evolution, and the timing of early interventions in the UK and Germany.
△ Less
Submitted 4 January, 2022; v1 submitted 1 September, 2021;
originally announced September 2021.
-
Analyzing Insect-Plant Predation Data By Bayesian Nonparametrics
Authors:
Fan Yang,
Takatomi Kubo,
Kazushi Ikeda
Abstract:
In the prospect of ecology and biology, studying insect-plant predation will considerably contribute to pest control, benefit agriculture and afforestation, and also help people to better understand insect-plant co-evolution. Therefore, we are motivated to do two work in this study. The first part is to cluster the insect-plant predation, in such manner, unobserved predation could be estimated. Th…
▽ More
In the prospect of ecology and biology, studying insect-plant predation will considerably contribute to pest control, benefit agriculture and afforestation, and also help people to better understand insect-plant co-evolution. Therefore, we are motivated to do two work in this study. The first part is to cluster the insect-plant predation, in such manner, unobserved predation could be estimated. The second part is to explore the connection between predation and bio-taxonomy, and we find insects get more divergence than plants during the insect-plant co-evolution.
△ Less
Submitted 11 December, 2019; v1 submitted 25 November, 2019;
originally announced November 2019.
-
Individuals' Mobility May Promote Criticality in Animal Collective Decision-making
Authors:
Feng Hu,
Zhi Ting Wang,
Fang Yang
Abstract:
It is highly believed that the individuals' mobility plays an important role in phase transition in animal collective motion. Here, we propose a model to study the effects of individuals' mobility in a distributed animal collective decision-making process, during which each individual faces two options with equal quality. We implement the quorum response rule, a type of social interaction rule whi…
▽ More
It is highly believed that the individuals' mobility plays an important role in phase transition in animal collective motion. Here, we propose a model to study the effects of individuals' mobility in a distributed animal collective decision-making process, during which each individual faces two options with equal quality. We implement the quorum response rule, a type of social interaction rule which is taxonomically recognized in animal collective decision-making, as the sole interaction rule. After the introduction of individuals' mobility, we find that the group can reach a consensus decision at one of the options at some critical points even the interaction is local. This result is an obvious contrast to the stationary individuals, the population of which is always equally distributed between the two options with fluctuations. In order to explore the information dynamics, we introduce an important information-theoretic measure, mutual information, to study the critical behaviors. Furthermore, we study the case when individuals interact globally, and also find some qualitative similar critical behaviors.
△ Less
Submitted 22 March, 2018;
originally announced March 2018.
-
Directionality is an inherent property of biochemical networks
Authors:
Feng Yang,
Feng Qi,
Daniel A. Beard
Abstract:
Thermodynamic constraints on reactions directions are inherent in the structure of a given biochemical network. However, concrete procedures for determining feasible reaction directions for large-scale metabolic networks are not well established. This work introduces a systematic approach to compute reaction directions, which are constrained by mass balance and thermodynamics, for genome-scale n…
▽ More
Thermodynamic constraints on reactions directions are inherent in the structure of a given biochemical network. However, concrete procedures for determining feasible reaction directions for large-scale metabolic networks are not well established. This work introduces a systematic approach to compute reaction directions, which are constrained by mass balance and thermodynamics, for genome-scale networks. In addition, it is shown that the nonconvex solution space constrained by physicochemical constraints can be approximated by a set of linearized subspaces in which mass and thermodynamic balance are guaranteed. The developed methodology can be used to {\it ab initio} predict reaction directions of genome-scale networks based solely on the network stoichoimetry.
△ Less
Submitted 1 February, 2007; v1 submitted 2 January, 2007;
originally announced January 2007.