-
Learning to Predict Mutation Effects of Protein-Protein Interactions by Microenvironment-aware Hierarchical Prompt Learning
Authors:
Lirong Wu,
Yijun Tian,
Haitao Lin,
Yufei Huang,
Siyuan Li,
Nitesh V Chawla,
Stan Z. Li
Abstract:
Protein-protein bindings play a key role in a variety of fundamental biological processes, and thus predicting the effects of amino acid mutations on protein-protein binding is crucial. To tackle the scarcity of annotated mutation data, pre-training with massive unlabeled data has emerged as a promising solution. However, this process faces a series of challenges: (1) complex higher-order dependen…
▽ More
Protein-protein bindings play a key role in a variety of fundamental biological processes, and thus predicting the effects of amino acid mutations on protein-protein binding is crucial. To tackle the scarcity of annotated mutation data, pre-training with massive unlabeled data has emerged as a promising solution. However, this process faces a series of challenges: (1) complex higher-order dependencies among multiple (more than paired) structural scales have not yet been fully captured; (2) it is rarely explored how mutations alter the local conformation of the surrounding microenvironment; (3) pre-training is costly, both in data size and computational burden. In this paper, we first construct a hierarchical prompt codebook to record common microenvironmental patterns at different structural scales independently. Then, we develop a novel codebook pre-training task, namely masked microenvironment modeling, to model the joint distribution of each mutation with their residue types, angular statistics, and local conformational changes in the microenvironment. With the constructed prompt codebook, we encode the microenvironment around each mutation into multiple hierarchical prompts and combine them to flexibly provide information to wild-type and mutated protein complexes about their microenvironmental differences. Such a hierarchical prompt learning framework has demonstrated superior performance and training efficiency over state-of-the-art pre-training-based methods in mutation effect prediction and a case study of optimizing human antibodies against SARS-CoV-2.
△ Less
Submitted 15 May, 2024;
originally announced May 2024.
-
ProLLaMA: A Protein Large Language Model for Multi-Task Protein Language Processing
Authors:
Liuzhenghao Lv,
Zongying Lin,
Hao Li,
Yuyang Liu,
Jiaxi Cui,
Calvin Yu-Chian Chen,
Li Yuan,
Yonghong Tian
Abstract:
Large Language Models (LLMs), including GPT-x and LLaMA2, have achieved remarkable performance in multiple Natural Language Processing (NLP) tasks. Under the premise that protein sequences constitute the protein language, Protein Large Language Models (ProLLMs) trained on protein corpora excel at de novo protein sequence generation. However, as of now, unlike LLMs in NLP, no ProLLM is capable of m…
▽ More
Large Language Models (LLMs), including GPT-x and LLaMA2, have achieved remarkable performance in multiple Natural Language Processing (NLP) tasks. Under the premise that protein sequences constitute the protein language, Protein Large Language Models (ProLLMs) trained on protein corpora excel at de novo protein sequence generation. However, as of now, unlike LLMs in NLP, no ProLLM is capable of multiple tasks in the Protein Language Processing (PLP) field. This prompts us to delineate the inherent limitations in current ProLLMs: (i) the lack of natural language capabilities, (ii) insufficient instruction understanding, and (iii) high training resource demands. To address these challenges, we introduce a training framework to transform any general LLM into a ProLLM capable of handling multiple PLP tasks. Specifically, our framework utilizes low-rank adaptation and employs a two-stage training approach, and it is distinguished by its universality, low overhead, and scalability. Through training under this framework, we propose the ProLLaMA model, the first known ProLLM to handle multiple PLP tasks simultaneously. Experiments show that ProLLaMA achieves state-of-the-art results in the unconditional protein sequence generation task. In the controllable protein sequence generation task, ProLLaMA can design novel proteins with desired functionalities. In the protein property prediction task, ProLLaMA achieves nearly 100\% accuracy across many categories. The latter two tasks are beyond the reach of other ProLLMs. Code is available at \url{https://github.com/Lyu6PosHao/ProLLaMA}.
△ Less
Submitted 26 February, 2024;
originally announced February 2024.
-
MAPE-PPI: Towards Effective and Efficient Protein-Protein Interaction Prediction via Microenvironment-Aware Protein Embedding
Authors:
Lirong Wu,
Yijun Tian,
Yufei Huang,
Siyuan Li,
Haitao Lin,
Nitesh V Chawla,
Stan Z. Li
Abstract:
Protein-Protein Interactions (PPIs) are fundamental in various biological processes and play a key role in life activities. The growing demand and cost of experimental PPI assays require computational methods for efficient PPI prediction. While existing methods rely heavily on protein sequence for PPI prediction, it is the protein structure that is the key to determine the interactions. To take bo…
▽ More
Protein-Protein Interactions (PPIs) are fundamental in various biological processes and play a key role in life activities. The growing demand and cost of experimental PPI assays require computational methods for efficient PPI prediction. While existing methods rely heavily on protein sequence for PPI prediction, it is the protein structure that is the key to determine the interactions. To take both protein modalities into account, we define the microenvironment of an amino acid residue by its sequence and structural contexts, which describe the surrounding chemical properties and geometric features. In addition, microenvironments defined in previous work are largely based on experimentally assayed physicochemical properties, for which the "vocabulary" is usually extremely small. This makes it difficult to cover the diversity and complexity of microenvironments. In this paper, we propose Microenvironment-Aware Protein Embedding for PPI prediction (MPAE-PPI), which encodes microenvironments into chemically meaningful discrete codes via a sufficiently large microenvironment "vocabulary" (i.e., codebook). Moreover, we propose a novel pre-training strategy, namely Masked Codebook Modeling (MCM), to capture the dependencies between different microenvironments by randomly masking the codebook and reconstructing the input. With the learned microenvironment codebook, we can reuse it as an off-the-shelf tool to efficiently and effectively encode proteins of different sizes and functions for large-scale PPI prediction. Extensive experiments show that MAPE-PPI can scale to PPI prediction with millions of PPIs with superior trade-offs between effectiveness and computational efficiency than the state-of-the-art competitors.
△ Less
Submitted 22 February, 2024;
originally announced February 2024.
-
A minimal model of cognition based on oscillatory and reinforcement processes
Authors:
Linnéa Gyllingberg,
Yu Tian,
David J. T. Sumpter
Abstract:
Building mathematical models of brains is difficult because of the sheer complexity of the problem. One potential starting point is through basal cognition, which give abstract representation of a range of organisms without central nervous systems, including fungi, slime moulds and bacteria. We propose one such model, demonstrating how a combination of oscillatory and current-based reinforcement p…
▽ More
Building mathematical models of brains is difficult because of the sheer complexity of the problem. One potential starting point is through basal cognition, which give abstract representation of a range of organisms without central nervous systems, including fungi, slime moulds and bacteria. We propose one such model, demonstrating how a combination of oscillatory and current-based reinforcement processes can be used to couple resources in an efficient manner, mimicking the way these organisms function. A key ingredient in our model, not found in previous basal cognition models, is that we explicitly model oscillations in the number of particles (i.e. the nutrients, chemical signals or similar, which make up the biological system) and the flow of these particles within the modelled organisms. Using this approach, we find that our model builds efficient solutions, provided the environmental oscillations are sufficiently out of phase. We further demonstrate that amplitude differences can promote efficient solutions and that the system is robust to frequency differences. In the context of these findings, we discuss connections between our model and basal cognition in biological systems and slime moulds, in particular, how oscillations might contribute to self-organised problem-solving by these organisms.
△ Less
Submitted 13 June, 2024; v1 submitted 4 February, 2024;
originally announced February 2024.
-
Theoretical foundations of studying criticality in the brain
Authors:
Yang Tian,
Zeren Tan,
Hedong Hou,
Guoqi Li,
Aohua Cheng,
Yike Qiu,
Kangyu Weng,
Chun Chen,
Pei Sun
Abstract:
Criticality is hypothesized as a physical mechanism underlying efficient transitions between cortical states and remarkable information processing capacities in the brain. While considerable evidence generally supports this hypothesis, non-negligible controversies persist regarding the ubiquity of criticality in neural dynamics and its role in information processing. Validity issues frequently ari…
▽ More
Criticality is hypothesized as a physical mechanism underlying efficient transitions between cortical states and remarkable information processing capacities in the brain. While considerable evidence generally supports this hypothesis, non-negligible controversies persist regarding the ubiquity of criticality in neural dynamics and its role in information processing. Validity issues frequently arise during identifying potential brain criticality from empirical data. Moreover, the functional benefits implied by brain criticality are frequently misconceived or unduly generalized. These problems stem from the non-triviality and immaturity of the physical theories that analytically derive brain criticality and the statistic techniques that estimate brain criticality from empirical data. To help solve these problems, we present a systematic review and reformulate the foundations of studying brain criticality, i.e., ordinary criticality (OC), quasi-criticality (qC), self-organized criticality (SOC), and self-organized quasi-criticality (SOqC), using the terminology of neuroscience. We offer accessible explanations of the physical theories and statistic techniques of brain criticality, providing step-by-step derivations to characterize neural dynamics as a physical system with avalanches. We summarize error-prone details and existing limitations in brain criticality analysis and suggest possible solutions. Moreover, we present a forward-looking perspective on how optimizing the foundations of studying brain criticality can deepen our understanding of various neuroscience questions.
△ Less
Submitted 8 June, 2023;
originally announced June 2023.
-
Deep recurrent spiking neural networks capture both static and dynamic representations of the visual cortex under movie stimuli
Authors:
Liwei Huang,
ZhengYu Ma,
Huihui Zhou,
Yonghong Tian
Abstract:
In the real world, visual stimuli received by the biological visual system are predominantly dynamic rather than static. A better understanding of how the visual cortex represents movie stimuli could provide deeper insight into the information processing mechanisms of the visual system. Although some progress has been made in modeling neural responses to natural movies with deep neural networks, t…
▽ More
In the real world, visual stimuli received by the biological visual system are predominantly dynamic rather than static. A better understanding of how the visual cortex represents movie stimuli could provide deeper insight into the information processing mechanisms of the visual system. Although some progress has been made in modeling neural responses to natural movies with deep neural networks, the visual representations of static and dynamic information under such time-series visual stimuli remain to be further explored. In this work, considering abundant recurrent connections in the mouse visual system, we design a recurrent module based on the hierarchy of the mouse cortex and add it into Deep Spiking Neural Networks, which have been demonstrated to be a more compelling computational model for the visual cortex. Using Time-Series Representational Similarity Analysis, we measure the representational similarity between networks and mouse cortical regions under natural movie stimuli. Subsequently, we conduct a comparison of the representational similarity across recurrent/feedforward networks and image/video training tasks. Trained on the video action recognition task, recurrent SNN achieves the highest representational similarity and significantly outperforms feedforward SNN trained on the same task by 15% and the recurrent SNN trained on the image classification task by 8%. We investigate how static and dynamic representations of SNNs influence the similarity, as a way to explain the importance of these two forms of representations in biological neural coding. Taken together, our work is the first to apply deep recurrent SNNs to model the mouse visual cortex under movie stimuli and we establish that these networks are competent to capture both static and dynamic representations and make contributions to understanding the movie information processing mechanisms of the visual cortex.
△ Less
Submitted 2 June, 2023;
originally announced June 2023.
-
ProGroTrack: Deep Learning-Assisted Tracking of Intracellular Protein Growth Dynamics
Authors:
Kai San Chan,
Huimiao Chen,
Chenyu Jin,
Yuxuan Tian,
Dingchang Lin
Abstract:
Accurate tracking of cellular and subcellular structures, along with their dynamics, plays a pivotal role in understanding the underlying mechanisms of biological systems. This paper presents a novel approach, ProGroTrack, that combines the You Only Look Once (YOLO) and ByteTrack algorithms within the detection-based tracking (DBT) framework to track intracellular protein nanostructures. Focusing…
▽ More
Accurate tracking of cellular and subcellular structures, along with their dynamics, plays a pivotal role in understanding the underlying mechanisms of biological systems. This paper presents a novel approach, ProGroTrack, that combines the You Only Look Once (YOLO) and ByteTrack algorithms within the detection-based tracking (DBT) framework to track intracellular protein nanostructures. Focusing on iPAK4 protein fibers as a representative case study, we conducted a comprehensive evaluation of YOLOv5 and YOLOv8 models, revealing the superior performance of YOLOv5 on our dataset. Notably, YOLOv5x achieved an impressive mAP50 of 0.839 and F-score of 0.819. To further optimize detection capabilities, we incorporated semi-supervised learning for model improvement, resulting in enhanced performances in all metrics. Subsequently, we successfully applied our approach to track the growth behavior of iPAK4 protein fibers, revealing their two distinct growth phases consistent with a previously reported kinetic model. This research showcases the promising potential of our approach, extending beyond iPAK4 fibers. It also offers a significant advancement in precise tracking of dynamic processes in live cells, and fostering new avenues for biomedical research.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
Deep Spiking Neural Networks with High Representation Similarity Model Visual Pathways of Macaque and Mouse
Authors:
Liwei Huang,
Zhengyu Ma,
Liutao Yu,
Huihui Zhou,
Yonghong Tian
Abstract:
Deep artificial neural networks (ANNs) play a major role in modeling the visual pathways of primate and rodent. However, they highly simplify the computational properties of neurons compared to their biological counterparts. Instead, Spiking Neural Networks (SNNs) are more biologically plausible models since spiking neurons encode information with time sequences of spikes, just like biological neu…
▽ More
Deep artificial neural networks (ANNs) play a major role in modeling the visual pathways of primate and rodent. However, they highly simplify the computational properties of neurons compared to their biological counterparts. Instead, Spiking Neural Networks (SNNs) are more biologically plausible models since spiking neurons encode information with time sequences of spikes, just like biological neurons do. However, there is a lack of studies on visual pathways with deep SNNs models. In this study, we model the visual cortex with deep SNNs for the first time, and also with a wide range of state-of-the-art deep CNNs and ViTs for comparison. Using three similarity metrics, we conduct neural representation similarity experiments on three neural datasets collected from two species under three types of stimuli. Based on extensive similarity analyses, we further investigate the functional hierarchy and mechanisms across species. Almost all similarity scores of SNNs are higher than their counterparts of CNNs with an average of 6.6%. Depths of the layers with the highest similarity scores exhibit little differences across mouse cortical regions, but vary significantly across macaque regions, suggesting that the visual processing structure of mice is more regionally homogeneous than that of macaques. Besides, the multi-branch structures observed in some top mouse brain-like neural networks provide computational evidence of parallel processing streams in mice, and the different performance in fitting macaque neural representations under different stimuli exhibits the functional specialization of information processing in macaques. Taken together, our study demonstrates that SNNs could serve as promising candidates to better model and explain the functional hierarchy and mechanisms of the visual system.
△ Less
Submitted 22 May, 2023; v1 submitted 9 March, 2023;
originally announced March 2023.
-
Bridging the information and dynamics attributes of neural activities
Authors:
Yang Tian,
Guoqi Li,
Pei Sun
Abstract:
The brain works as a dynamic system to process information. Various challenges remain in understanding the connection between information and dynamics attributes in the brain. The present research pursues exploring how the characteristics of neural information functions are linked to neural dynamics. We attempt to bridge dynamics (e.g., Kolmogorov-Sinai entropy) and information (e.g., mutual infor…
▽ More
The brain works as a dynamic system to process information. Various challenges remain in understanding the connection between information and dynamics attributes in the brain. The present research pursues exploring how the characteristics of neural information functions are linked to neural dynamics. We attempt to bridge dynamics (e.g., Kolmogorov-Sinai entropy) and information (e.g., mutual information and Fisher information) metrics on the stimulus-triggered stochastic dynamics in neural populations. On the one hand, our unified analysis identifies various essential features of the information-processing-related neural dynamics. We discover spatiotemporal differences in the dynamic randomness and chaotic degrees of neural dynamics during neural information processing. On the other hand, our framework reveals the fundamental role of neural dynamics in shaping neural information processing. The neural dynamics creates an oppositely directed variation of encoding and decoding properties under specific conditions, and it determines the neural representation of stimulus distribution. Overall, our findings demonstrate a potential direction to explain the emergence of neural information processing from neural dynamics and help understand the intrinsic connections between the informational and the physical brain.
△ Less
Submitted 13 July, 2022;
originally announced July 2022.
-
Graph-based Molecular Representation Learning
Authors:
Zhichun Guo,
Kehan Guo,
Bozhao Nan,
Yijun Tian,
Roshni G. Iyer,
Yihong Ma,
Olaf Wiest,
Xiangliang Zhang,
Wei Wang,
Chuxu Zhang,
Nitesh V. Chawla
Abstract:
Molecular representation learning (MRL) is a key step to build the connection between machine learning and chemical science. In particular, it encodes molecules as numerical vectors preserving the molecular structures and features, on top of which the downstream tasks (e.g., property prediction) can be performed. Recently, MRL has achieved considerable progress, especially in methods based on deep…
▽ More
Molecular representation learning (MRL) is a key step to build the connection between machine learning and chemical science. In particular, it encodes molecules as numerical vectors preserving the molecular structures and features, on top of which the downstream tasks (e.g., property prediction) can be performed. Recently, MRL has achieved considerable progress, especially in methods based on deep molecular graph learning. In this survey, we systematically review these graph-based molecular representation techniques, especially the methods incorporating chemical domain knowledge. Specifically, we first introduce the features of 2D and 3D molecular graphs. Then we summarize and categorize MRL methods into three groups based on their input. Furthermore, we discuss some typical chemical applications supported by MRL. To facilitate studies in this fast-developing area, we also list the benchmarks and commonly used datasets in the paper. Finally, we share our thoughts on future research directions.
△ Less
Submitted 28 November, 2023; v1 submitted 8 July, 2022;
originally announced July 2022.
-
Detecting Schizophrenia with 3D Structural Brain MRI Using Deep Learning
Authors:
Junhao Zhang,
Vishwanatha M. Rao,
Ye Tian,
Yanting Yang,
Nicolas Acosta,
Zihan Wan,
Pin-Yu Lee,
Chloe Zhang,
Lawrence S. Kegeles,
Scott A. Small,
Jia Guo
Abstract:
Schizophrenia is a chronic neuropsychiatric disorder that causes distinct structural alterations within the brain. We hypothesize that deep learning applied to a structural neuroimaging dataset could detect disease-related alteration and improve classification and diagnostic accuracy. We tested this hypothesis using a single, widely available, and conventional T1-weighted MRI scan, from which we e…
▽ More
Schizophrenia is a chronic neuropsychiatric disorder that causes distinct structural alterations within the brain. We hypothesize that deep learning applied to a structural neuroimaging dataset could detect disease-related alteration and improve classification and diagnostic accuracy. We tested this hypothesis using a single, widely available, and conventional T1-weighted MRI scan, from which we extracted the 3D whole-brain structure using standard post-processing methods. A deep learning model was then developed, optimized, and evaluated on three open datasets with T1-weighted MRI scans of patients with schizophrenia. Our proposed model outperformed the benchmark model, which was also trained with structural MR images using a 3D CNN architecture. Our model is capable of almost perfectly (area under the ROC curve = 0.987) distinguishing schizophrenia patients from healthy controls on unseen structural MRI scans. Regional analysis localized subcortical regions and ventricles as the most predictive brain regions. Subcortical structures serve a pivotal role in cognitive, affective, and social functions in humans, and structural abnormalities of these regions have been associated with schizophrenia. Our finding corroborates that schizophrenia is associated with widespread alterations in subcortical brain structure and the subcortical structural information provides prominent features in diagnostic classification. Together, these results further demonstrate the potential of deep learning to improve schizophrenia diagnosis and identify its structural neuroimaging signatures from a single, standard T1-weighted brain MRI.
△ Less
Submitted 7 July, 2022; v1 submitted 26 June, 2022;
originally announced June 2022.
-
Self-organized critical dynamics of RNA virus evolution
Authors:
Xiaofei Ge,
Kaichao You,
Zeren Tan,
Hedong Hou,
Yang Tian,
Pei Sun
Abstract:
RNA virus (e.g., SARS-CoV-2) evolves in a complex manner. Studying RNA virus evolution is vital for understanding molecular evolution and medicine development. Scientists lack, however, general frameworks to characterize the dynamics of RNA virus evolution directly from empirical data and identify potential physical laws. To fill this gap, we present a theory to characterize the RNA virus evolutio…
▽ More
RNA virus (e.g., SARS-CoV-2) evolves in a complex manner. Studying RNA virus evolution is vital for understanding molecular evolution and medicine development. Scientists lack, however, general frameworks to characterize the dynamics of RNA virus evolution directly from empirical data and identify potential physical laws. To fill this gap, we present a theory to characterize the RNA virus evolution as a physical system with absorbing states and avalanche behaviors. This approach maps accessible biological data (e.g., phylogenetic tree and infection) to a general stochastic process of RNA virus infection and evolution, enabling researchers to verify potential self-organized criticality underlying RNA virus evolution. We apply our framework to SARS-CoV-2, the virus accounting for the global epidemic of COVID-19. We find that SARS-CoV-2 exhibits scale-invariant avalanches as mean-field theory predictions. The observed scaling relation, universal collapse, and slowly decaying auto-correlation suggest a self-organized critical dynamics of SARS-CoV-2 evolution. Interestingly, the lineages that emerge from critical evolution processes coincidentally match with threatening lineages of SARS-CoV-2 (e.g., the Delta virus). We anticipate our approach to be a general formalism to portray RNA virus evolution and help identify potential virus lineages to be concerned.
△ Less
Submitted 18 April, 2022;
originally announced April 2022.
-
Thermodynamics of Encoding and Encoders
Authors:
Yang Tian,
Pei Sun
Abstract:
Non-isolated systems have diverse coupling relations with the external environment. These relations generate complex thermodynamics and information transmission between the system and its environment. The framework depicted in the current research attempts to glance at the critical role of the internal orders inside the non-isolated system in shaping the information thermodynamics coupling. We cha…
▽ More
Non-isolated systems have diverse coupling relations with the external environment. These relations generate complex thermodynamics and information transmission between the system and its environment. The framework depicted in the current research attempts to glance at the critical role of the internal orders inside the non-isolated system in shaping the information thermodynamics coupling. We characterize the coupling as a generalized encoding process, where the system acts as an information thermodynamics encoder to encode the external information based on thermodynamics. We formalize the encoding process in the context of the nonequilibrium second law of thermodynamics, revealing an intrinsic difference in information thermodynamics characteristics between information thermodynamics encoders with and without internal correlations. During the information encoding process of an external source $\mathsf{Y}$, specific sub-systems in an encoder $\mathsf{X}$ with internal correlations can exceed the information thermodynamics bound on $\left(\mathsf{X},\mathsf{Y}\right)$ and encode more information than system $\mathsf{X}$ works as a whole. We computationally verify this theoretical finding in an Ising model with a random external field and a neural data set of the human brain during visual perception and recognition. Our analysis demonstrates that the stronger internal correlation inside these systems implies a higher possibility for specific sub-systems to encode more information than the global one. These findings may suggest a new perspective in studying information thermodynamics in diverse physical and biological systems.
△ Less
Submitted 12 November, 2021;
originally announced November 2021.
-
Information Evolution in Complex Networks
Authors:
Yang Tian,
Guoqi Li,
Pei Sun
Abstract:
Many biological phenomena or social events critically depend on how information evolves in complex networks. However, a general theory to characterize information evolution is yet absent. Consequently, numerous unknowns remain about the mechanisms underlying information evolution. Among these unknowns, a fundamental problem, being a seeming paradox, lies in the coexistence of local randomness, man…
▽ More
Many biological phenomena or social events critically depend on how information evolves in complex networks. However, a general theory to characterize information evolution is yet absent. Consequently, numerous unknowns remain about the mechanisms underlying information evolution. Among these unknowns, a fundamental problem, being a seeming paradox, lies in the coexistence of local randomness, manifested as the stochastic distortion of information content during individual-individual diffusion, and global regularity, illustrated by specific non-random patterns of information content on the network scale. Here, we attempt to formalize information evolution and explain the coexistence of randomness and regularity in complex networks. Applying network dynamics and information theory, we discover that a certain amount of information, determined by the selectivity of networks to the input information, frequently survives from random distortion. Other information will inevitably experience distortion or dissipation, whose speeds are shaped by the diversity of information selectivity in networks. The discovered laws exist irrespective of noise, but the noise accounts for the intensification. We further demonstrate the ubiquity of our discovered laws by analyzing the emergence of neural tuning properties in the primary visual and medial temporal cortices of animal brains and the emergence of extreme opinions in social networks.
△ Less
Submitted 18 April, 2022; v1 submitted 12 November, 2021;
originally announced November 2021.
-
TDACNN: Target-domain-free Domain Adaptation Convolutional Neural Network for Drift Compensation in Gas Sensors
Authors:
Yuelin Zhang,
Sihao Xiang,
Zehuan Wang,
Xiaoyan Peng,
Yutong Tian,
Shukai Duan,
Jia Yan
Abstract:
Sensor drift is a long-existing unpredictable problem that deteriorates the performance of gaseous substance recognition, calling for an antidrift domain adaptation algorithm. However, the prerequisite for traditional methods to achieve fine results is to have data from both nondrift distributions (source domain) and drift distributions (target domain) for domain alignment, which is usually unreal…
▽ More
Sensor drift is a long-existing unpredictable problem that deteriorates the performance of gaseous substance recognition, calling for an antidrift domain adaptation algorithm. However, the prerequisite for traditional methods to achieve fine results is to have data from both nondrift distributions (source domain) and drift distributions (target domain) for domain alignment, which is usually unrealistic and unachievable in real-life scenarios. To compensate for this, in this paper, deep learning based on a target-domain-free domain adaptation convolutional neural network (TDACNN) is proposed. The main concept is that CNNs extract not only the domain-specific features of samples but also the domain-invariant features underlying both the source and target domains. Making full use of these various levels of embedding features can lead to comprehensive utilization of different levels of characteristics, thus achieving drift compensation by the extracted intermediate features between two domains. In the TDACNN, a flexible multibranch backbone with a multiclassifier structure is proposed under the guidance of bionics, which utilizes multiple embedding features comprehensively without involving target domain data during training. A classifier ensemble method based on maximum mean discrepancy (MMD) is proposed to evaluate all the classifiers jointly based on the credibility of the pseudolabel. To optimize network training, an additive angular margin softmax loss with parameter dynamic adjustment is utilized. Experiments on two drift datasets under different settings demonstrate the superiority of TDACNN compared with several state-of-the-art methods.
△ Less
Submitted 26 March, 2022; v1 submitted 14 October, 2021;
originally announced October 2021.
-
All-Fibre Label-Free Nano-Sensor for Real-Time in situ Early Monitoring of Cellular Apoptosis
Authors:
Danran Li,
Nina Wang,
Tianyang Zhang,
Guangxing Wu,
Yifeng Xiong,
Qianqian Du,
Yunfei Tian,
Wei-wei Zhao,
Jiandong Ye,
Shulin Gu,
Yanqing Lu,
Dechen Jiang,
Fei Xu
Abstract:
The achievement of all-fibre functional nano-modules for subcellular label-free measurement has long been pursued due to the limitations of manufacturing techniques. In this paper, a compact all-fibre label-free nano-sensor composed of a fibre taper and zinc oxide nano-gratings is designed and applied for the early monitoring of apoptosis in single living cells. Because of its nanoscale dimensions…
▽ More
The achievement of all-fibre functional nano-modules for subcellular label-free measurement has long been pursued due to the limitations of manufacturing techniques. In this paper, a compact all-fibre label-free nano-sensor composed of a fibre taper and zinc oxide nano-gratings is designed and applied for the early monitoring of apoptosis in single living cells. Because of its nanoscale dimensions, mechanical flexibility and minimal cytotoxicity to cells, the sensing module can be loaded in cells for long-term in situ tracking with high sensitivity. A gradual increase in the nuclear refractive index during the apoptosis process is observed, revealing the increase in molecular density and the decrease in cell volume. The strategy used in this study not only contributes to the understanding of internal environmental variations during cellular apoptosis but also provides a new platform for non-fluorescent all-fibre devices to investigate cellular events and to promote new progress in fundamental cell biochemical engineering.
△ Less
Submitted 29 May, 2021;
originally announced May 2021.
-
Learning abstract structure for drawing by efficient motor program induction
Authors:
Lucas Y. Tian,
Kevin Ellis,
Marta Kryven,
Joshua B. Tenenbaum
Abstract:
Humans flexibly solve new problems that differ qualitatively from those they were trained on. This ability to generalize is supported by learned concepts that capture structure common across different problems. Here we develop a naturalistic drawing task to study how humans rapidly acquire structured prior knowledge. The task requires drawing visual objects that share underlying structure, based o…
▽ More
Humans flexibly solve new problems that differ qualitatively from those they were trained on. This ability to generalize is supported by learned concepts that capture structure common across different problems. Here we develop a naturalistic drawing task to study how humans rapidly acquire structured prior knowledge. The task requires drawing visual objects that share underlying structure, based on a set of composable geometric rules. We show that people spontaneously learn abstract drawing procedures that support generalization, and propose a model of how learners can discover these reusable drawing programs. Trained in the same setting as humans, and constrained to produce efficient motor actions, this model discovers new drawing routines that transfer to test objects and resemble learned features of human sequences. These results suggest that two principles guiding motor program induction in the model - abstraction (general programs that ignore object-specific details) and compositionality (recombining previously learned programs) - are key for explaining how humans learn structured internal representations that guide flexible reasoning and learning.
△ Less
Submitted 8 August, 2020;
originally announced August 2020.
-
Taking the pulse of COVID-19: A spatiotemporal perspective
Authors:
Chaowei Yang,
Dexuan Sha,
Qian Liu,
Yun Li,
Hai Lan,
Weihe Wendy Guan,
Tao Hu,
Zhenlong Li,
Zhiran Zhang,
John Hoot Thompson,
Zifu Wang,
David Wong,
Shiyang Ruan,
Manzhu Yu,
Douglas Richardson,
Luyao Zhang,
Ruizhi Hou,
You Zhou,
Cheng Zhong,
Yifei Tian,
Fayez Beaini,
Kyla Carte,
Colin Flynn,
Wei Liu,
Dieter Pfoser
, et al. (10 additional authors not shown)
Abstract:
The sudden outbreak of the Coronavirus disease (COVID-19) swept across the world in early 2020, triggering the lockdowns of several billion people across many countries, including China, Spain, India, the U.K., Italy, France, Germany, and most states of the U.S. The transmission of the virus accelerated rapidly with the most confirmed cases in the U.S., and New York City became an epicenter of the…
▽ More
The sudden outbreak of the Coronavirus disease (COVID-19) swept across the world in early 2020, triggering the lockdowns of several billion people across many countries, including China, Spain, India, the U.K., Italy, France, Germany, and most states of the U.S. The transmission of the virus accelerated rapidly with the most confirmed cases in the U.S., and New York City became an epicenter of the pandemic by the end of March. In response to this national and global emergency, the NSF Spatiotemporal Innovation Center brought together a taskforce of international researchers and assembled implemented strategies to rapidly respond to this crisis, for supporting research, saving lives, and protecting the health of global citizens. This perspective paper presents our collective view on the global health emergency and our effort in collecting, analyzing, and sharing relevant data on global policy and government responses, geospatial indicators of the outbreak and evolving forecasts; in developing research capabilities and mitigation measures with global scientists, promoting collaborative research on outbreak dynamics, and reflecting on the dynamic responses from human societies.
△ Less
Submitted 8 May, 2020;
originally announced May 2020.
-
Towards the Next Generation of Retinal Neuroprosthesis: Visual Computation with Spikes
Authors:
Zhaofei Yu,
Jian K. Liu,
Shanshan Jia,
Yichen Zhang,
Yajing Zheng,
Yonghong Tian,
Tiejun Huang
Abstract:
Neuroprosthesis, as one type of precision medicine device, is aiming for manipulating neuronal signals of the brain in a closed-loop fashion, together with receiving stimulus from the environment and controlling some part of our brain/body. In terms of vision, incoming information can be processed by the brain in millisecond interval. The retina computes visual scenes and then sends its output as…
▽ More
Neuroprosthesis, as one type of precision medicine device, is aiming for manipulating neuronal signals of the brain in a closed-loop fashion, together with receiving stimulus from the environment and controlling some part of our brain/body. In terms of vision, incoming information can be processed by the brain in millisecond interval. The retina computes visual scenes and then sends its output as neuronal spikes to the cortex for further computation. Therefore, the neuronal signal of interest for retinal neuroprosthesis is spike. Closed-loop computation in neuroprosthesis includes two stages: encoding stimulus to neuronal signal, and decoding it into stimulus. Here we review some of the recent progress about visual computation models that use spikes for analyzing natural scenes, including static images and dynamic movies. We hypothesize that for a better understanding of computational principles in the retina, one needs a hypercircuit view of the retina, in which different functional network motifs revealed in the cortex neuronal network should be taken into consideration for the retina. Different building blocks of the retina, including a diversity of cell types and synaptic connections, either chemical synapses or electrical synapses (gap junctions), make the retina an ideal neuronal network to adapt the computational techniques developed in artificial intelligence for modeling of encoding/decoding visual scenes. Altogether, one needs a systems approach of visual computation with spikes to advance the next generation of retinal neuroprosthesis as an artificial visual system.
△ Less
Submitted 13 January, 2020;
originally announced January 2020.
-
Reconstruction of Natural Visual Scenes from Neural Spikes with Deep Neural Networks
Authors:
Yichen Zhang,
Shanshan Jia,
Yajing Zheng,
Zhaofei Yu,
Yonghong Tian,
Siwei Ma,
Tiejun Huang,
Jian K. Liu
Abstract:
Neural coding is one of the central questions in systems neuroscience for understanding how the brain processes stimulus from the environment, moreover, it is also a cornerstone for designing algorithms of brain-machine interface, where decoding incoming stimulus is highly demanded for better performance of physical devices. Traditionally researchers have focused on functional magnetic resonance i…
▽ More
Neural coding is one of the central questions in systems neuroscience for understanding how the brain processes stimulus from the environment, moreover, it is also a cornerstone for designing algorithms of brain-machine interface, where decoding incoming stimulus is highly demanded for better performance of physical devices. Traditionally researchers have focused on functional magnetic resonance imaging (fMRI) data as the neural signals of interest for decoding visual scenes. However, our visual perception operates in a fast time scale of millisecond in terms of an event termed neural spike. There are few studies of decoding by using spikes. Here we fulfill this aim by developing a novel decoding framework based on deep neural networks, named spike-image decoder (SID), for reconstructing natural visual scenes, including static images and dynamic videos, from experimentally recorded spikes of a population of retinal ganglion cells. The SID is an end-to-end decoder with one end as neural spikes and the other end as images, which can be trained directly such that visual scenes are reconstructed from spikes in a highly accurate fashion. Our SID also outperforms on the reconstruction of visual stimulus compared to existing fMRI decoding models. In addition, with the aid of a spike encoder, we show that SID can be generalized to arbitrary visual scenes by using the image datasets of MNIST, CIFAR10, and CIFAR100. Furthermore, with a pre-trained SID, one can decode any dynamic videos to achieve real-time encoding and decoding of visual scenes by spikes. Altogether, our results shed new light on neuromorphic computing for artificial visual systems, such as event-based visual cameras and visual neuroprostheses.
△ Less
Submitted 28 January, 2020; v1 submitted 29 April, 2019;
originally announced April 2019.
-
Probabilistic Inference of Binary Markov Random Fields in Spiking Neural Networks through Mean-field Approximation
Authors:
Yajing Zheng,
Shanshan Jia,
Zhaofei Yu,
Tiejun Huang,
Jian K. Liu,
Yonghong Tian
Abstract:
Recent studies have suggested that the cognitive process of the human brain is realized as probabilistic inference and can be further modeled by probabilistic graphical models like Markov random fields. Nevertheless, it remains unclear how probabilistic inference can be implemented by a network of spiking neurons in the brain. Previous studies have tried to relate the inference equation of binary…
▽ More
Recent studies have suggested that the cognitive process of the human brain is realized as probabilistic inference and can be further modeled by probabilistic graphical models like Markov random fields. Nevertheless, it remains unclear how probabilistic inference can be implemented by a network of spiking neurons in the brain. Previous studies have tried to relate the inference equation of binary Markov random fields to the dynamic equation of spiking neural networks through belief propagation algorithm and reparameterization, but they are valid only for Markov random fields with limited network structure. In this paper, we propose a spiking neural network model that can implement inference of arbitrary binary Markov random fields. Specifically, we design a spiking recurrent neural network and prove that its neuronal dynamics are mathematically equivalent to the inference process of Markov random fields by adopting mean-field theory. Furthermore, our mean-field approach unifies previous works. Theoretical analysis and experimental results, together with the application to image denoising, demonstrate that our proposed spiking neural network can get comparable results to that of mean-field inference.
△ Less
Submitted 12 March, 2020; v1 submitted 22 February, 2019;
originally announced February 2019.
-
Revealing Fine Structures of the Retinal Receptive Field by Deep Learning Networks
Authors:
Qi Yan,
Yajing Zheng,
Shanshan Jia,
Yichen Zhang,
Zhaofei Yu,
Feng Chen,
Yonghong Tian,
Tiejun Huang,
Jian K. Liu
Abstract:
Deep convolutional neural networks (CNNs) have demonstrated impressive performance on many visual tasks. Recently, they became useful models for the visual system in neuroscience. However, it is still not clear what are learned by CNNs in terms of neuronal circuits. When a deep CNN with many layers is used for the visual system, it is not easy to compare the structure components of CNNs with possi…
▽ More
Deep convolutional neural networks (CNNs) have demonstrated impressive performance on many visual tasks. Recently, they became useful models for the visual system in neuroscience. However, it is still not clear what are learned by CNNs in terms of neuronal circuits. When a deep CNN with many layers is used for the visual system, it is not easy to compare the structure components of CNNs with possible neuroscience underpinnings due to highly complex circuits from the retina to higher visual cortex. Here we address this issue by focusing on single retinal ganglion cells with biophysical models and recording data from animals. By training CNNs with white noise images to predict neuronal responses, we found that fine structures of the retinal receptive field can be revealed. Specifically, convolutional filters learned are resembling biological components of the retinal circuit. This suggests that a CNN learning from one single retinal cell reveals a minimal neural network carried out in this cell. Furthermore, when CNNs learned from different cells are transferred between cells, there is a diversity of transfer learning performance, which indicates that CNNs are cell-specific. Moreover, when CNNs are transferred between different types of input images, here white noise v.s. natural images, transfer learning shows a good performance, which implies that CNNs indeed capture the full computational ability of a single retinal cell for different inputs. Taken together, these results suggest that CNNs could be used to reveal structure components of neuronal circuits, and provide a powerful model for neural system identification.
△ Less
Submitted 18 February, 2020; v1 submitted 6 November, 2018;
originally announced November 2018.
-
Neural System Identification with Spike-triggered Non-negative Matrix Factorization
Authors:
Shanshan Jia,
Zhaofei Yu,
Arno Onken,
Yonghong Tian,
Tiejun Huang,
Jian K. Liu
Abstract:
Neuronal circuits formed in the brain are complex with intricate connection patterns. Such complexity is also observed in the retina as a relatively simple neuronal circuit. A retinal ganglion cell receives excitatory inputs from neurons in previous layers as driving forces to fire spikes. Analytical methods are required that can decipher these components in a systematic manner. Recently a method…
▽ More
Neuronal circuits formed in the brain are complex with intricate connection patterns. Such complexity is also observed in the retina as a relatively simple neuronal circuit. A retinal ganglion cell receives excitatory inputs from neurons in previous layers as driving forces to fire spikes. Analytical methods are required that can decipher these components in a systematic manner. Recently a method termed spike-triggered non-negative matrix factorization (STNMF) has been proposed for this purpose. In this study, we extend the scope of the STNMF method. By using the retinal ganglion cell as a model system, we show that STNMF can detect various computational properties of upstream bipolar cells, including spatial receptive field, temporal filter, and transfer nonlinearity. In addition, we recover synaptic connection strengths from the weight matrix of STNMF. Furthermore, we show that STNMF can separate spikes of a ganglion cell into a few subsets of spikes where each subset is contributed by one presynaptic bipolar cell. Taken together, these results corroborate that STNMF is a useful method for deciphering the structure of neuronal circuits.
△ Less
Submitted 1 March, 2020; v1 submitted 12 August, 2018;
originally announced August 2018.
-
Winner-Take-All as Basic Probabilistic Inference Unit of Neuronal Circuits
Authors:
Zhaofei Yu,
Yonghong Tian,
Tiejun Huang,
Jian K. Liu
Abstract:
Experimental observations of neuroscience suggest that the brain is working a probabilistic way when computing information with uncertainty. This processing could be modeled as Bayesian inference. However, it remains unclear how Bayesian inference could be implemented at the level of neuronal circuits of the brain. In this study, we propose a novel general-purpose neural implementation of probabil…
▽ More
Experimental observations of neuroscience suggest that the brain is working a probabilistic way when computing information with uncertainty. This processing could be modeled as Bayesian inference. However, it remains unclear how Bayesian inference could be implemented at the level of neuronal circuits of the brain. In this study, we propose a novel general-purpose neural implementation of probabilistic inference based on a ubiquitous network of cortical microcircuits, termed winner-take-all (WTA) circuit. We show that each WTA circuit could encode the distribution of states defined on a variable. By connecting multiple WTA circuits together, the joint distribution can be represented for arbitrary probabilistic graphical models. Moreover, we prove that the neural dynamics of WTA circuit is able to implement one of the most powerful inference methods in probabilistic graphical models, mean-field inference. We show that the synaptic drive of each spiking neuron in the WTA circuit encodes the marginal probability of the variable in each state, and the firing probability (or firing rate) of each neuron is proportional to the marginal probability. Theoretical analysis and experimental results demonstrate that the WTA circuits can get comparable inference result as mean-field approximation. Taken together, our results suggest that the WTA circuit could be seen as the minimal inference unit of neuronal circuits.
△ Less
Submitted 2 August, 2018;
originally announced August 2018.
-
A simple blind-denoising filter inspired by electrically coupled photoreceptors in the retina
Authors:
Yang Yue,
Liuyuan He,
Gan He,
Jian. K. Liu,
Kai Du,
Yonghong Tian,
Tiejun Huang
Abstract:
Photoreceptors in the retina are coupled by electrical synapses called "gap junctions". It has long been established that gap junctions increase the signal-to-noise ratio of photoreceptors. Inspired by electrically coupled photoreceptors, we introduced a simple filter, the PR-filter, with only one variable. On BSD68 dataset, PR-filter showed outstanding performance in SSIM during blind denoising t…
▽ More
Photoreceptors in the retina are coupled by electrical synapses called "gap junctions". It has long been established that gap junctions increase the signal-to-noise ratio of photoreceptors. Inspired by electrically coupled photoreceptors, we introduced a simple filter, the PR-filter, with only one variable. On BSD68 dataset, PR-filter showed outstanding performance in SSIM during blind denoising tasks. It also significantly improved the performance of state-of-the-art convolutional neural network blind denosing on non-Gaussian noise. The performance of keeping more details might be attributed to small receptive field of the photoreceptors.
△ Less
Submitted 27 August, 2018; v1 submitted 15 June, 2018;
originally announced June 2018.
-
Impact of Land Use on the DOM Composition in Different Seasons in a Subtropical River Flowing through the Region Undergoing Rapid Urbanization
Authors:
Qi Liu,
Yuan Jiang,
Zhaojiang Hou,
Yulu Tian,
Kejian He,
Lan Fu,
Hui Xu
Abstract:
The dissolved organic matter (DOM) composition in river ecosystems could reflect the human impacts on the river ecosystem, and plays an important role in the carbon cycling process. We collected water and phytoplankton samples at 107 sites in the Dongjiang River in two seasons to assess the impact of the sub-catchments land use structure on the DOM composition. The results showed that (1) the fore…
▽ More
The dissolved organic matter (DOM) composition in river ecosystems could reflect the human impacts on the river ecosystem, and plays an important role in the carbon cycling process. We collected water and phytoplankton samples at 107 sites in the Dongjiang River in two seasons to assess the impact of the sub-catchments land use structure on the DOM composition. The results showed that (1) the forested sub-catchments had higher humic-like C1 (16.45%) and C2 (25.04%) and lower protein-like C3 (22.57%) and C4 (35.95%) than urbanized and mixed forest-agriculture sub-catchments, while the urbanized sub-catchments showed an inverse trend (4.54%, 15.51%, 33.97% and 45.98%, respectively). (2) The significant variation in the proportion of C1 and C4 between the dry and rainy seasons was recorded in both the forested and the mixed forest-agriculture sub-catchments (p<0.01), but only C4 showed an obvious seasonal variation in the urbanized sub-catchments (p<0.01). While the DOM composition was mainly related to the proportion of urbanized land and forested land year-round (p<0.01), it had stronger correlation with forested land in the dry season and urbanized land in the rainy season. (3) No significant correlation between the DOM composition and the agricultural land proportion was found in either season (p>0.05). Our findings indicated that the DOM composition was strongly dependent on the land use structure of the sub-catchments and varied seasonally, but the seasonal variation pattern could be disturbed by human activities in the extensively urbanized catchments. Notably, the higher C4 proportion compared with those in temperate rivers implied a larger amount of CO2 released from the subtropical rivers into the atmosphere when considering bioavailability.
△ Less
Submitted 7 February, 2018;
originally announced February 2018.
-
A Statistical Approach to Identifying Significant Transgenerational Methylation Changes
Authors:
Ye Tian,
Yi Fu,
Guoqiang Yu,
Bai Zhang,
Yue Wang
Abstract:
Epigenetic aberrations have profound effects on phenotypic output. Genome wide methylation alterations are inheritable to pass down the aberrations through multiple generations. We developed a statistical method, Genome-wide Identification of Significant Methylation Alteration, GISAIM, to study the significant transgenerational methylation changes. GISAIM finds the significant methylation aberrati…
▽ More
Epigenetic aberrations have profound effects on phenotypic output. Genome wide methylation alterations are inheritable to pass down the aberrations through multiple generations. We developed a statistical method, Genome-wide Identification of Significant Methylation Alteration, GISAIM, to study the significant transgenerational methylation changes. GISAIM finds the significant methylation aberrations that are inherited through multiple generations. In a concrete biological study, we investigated whether exposing pregnant rats (F0) to a high fat (HF) diet throughout pregnancy or ethinyl estradiol (EE2)-supplemented diet during gestation days 14 20 affects carcinogen-induced mammary cancer risk in daughters (F1), granddaughters (F2) and great-granddaughters (F3). Mammary tumorigenesis was higher in daughters and granddaughters of HF rat dams, and in daughters, granddaughters and great-granddaughters of EE2 rat dams. Outcross experiments showed that increased mammary cancer risk was transmitted to HF granddaughters equally through the female or male germlines, but is only transmitted to EE2 granddaughters through the female germline. Transgenerational effect on mammary cancer risk was associated with increased expression of DNA methyltransferases, and across all three EE2 generations hypo or hyper methylation of the same 375 gene promoter regions in their mammary glands. Our study shows that maternal dietary estrogenic exposures during pregnancy can increase breast cancer risk in multiple generations of offspring, and the increase in risk may be inherited through non-genetic means, possibly involving DNA methylation.
△ Less
Submitted 27 September, 2014;
originally announced September 2014.
-
Impact of delay on HIV-1 dynamics of fighting a virus with another virus
Authors:
Yun Tian,
Yu Bai,
Pei Yu
Abstract:
In this paper, we propose a mathematical model for HIV-1 infection with intracellular delay. The model examines a viral-therapy for controlling infections through recombining HIV-1 virus with a genetically modified virus. For this model, the basic reproduction number $\mathcal{R}_0$ are identified and its threshold properties are discussed. When $\mathcal{R}_0 < 1$, the infection-free equilibrium…
▽ More
In this paper, we propose a mathematical model for HIV-1 infection with intracellular delay. The model examines a viral-therapy for controlling infections through recombining HIV-1 virus with a genetically modified virus. For this model, the basic reproduction number $\mathcal{R}_0$ are identified and its threshold properties are discussed. When $\mathcal{R}_0 < 1$, the infection-free equilibrium $E_0$ is globally asymptotically stable. When $\mathcal{R}_0 > 1$, $E_0$ becomes unstable and there occurs the single-infection equilibrium $E_s$, and $E_0$ and $E_s$ exchange their stability at the transcritical point $\mathcal{R}_0 =1$. If $1< \mathcal{R}_0 < R_1$, where $R_1$ is a positive constant explicitly depending on the model parameters, $E_s$ is globally asymptotically stable, while when $\mathcal{R}_0 > R_1$, $E_s$ loses its stability to the double-infection equilibrium $E_d$. There exist a constant $R_2$ such that $E_d$ is asymptotically stable if $R_1<\mathcal R_0 < R_2$, and $E_s$ and $E_d$ exchange their stability at the transcritical point $\mathcal{R}_0 =R_1$. We use one numerical example to determine the largest range of $\mathcal R_0$ for the local stability of $E_d$ and existence of Hopf bifurcation. Some simulations are performed to support the theoretical results. These results show that the delay plays an important role in determining the dynamic behaviour of the system. In the normal range of values, the delay may change the dynamic behaviour quantitatively, such as greatly reducing the amplitudes of oscillations, or even qualitatively changes the dynamical behaviour such as revoking oscillating solutions to equilibrium solutions. This suggests that the delay is a very important fact which should not be missed in HIV-1 modelling.
△ Less
Submitted 9 April, 2014; v1 submitted 16 March, 2014;
originally announced March 2014.
-
Knowledge-fused differential dependency network models for detecting significant rewiring in biological networks
Authors:
Ye Tian,
Bai Zhang,
Eric P. Hoffman,
Robert Clarke,
Zhen Zhang,
Ie-Ming Shih,
Jianhua Xuan,
David M. Herrington,
Yue Wang
Abstract:
Modeling biological networks serves as both a major goal and an effective tool of systems biology in studying mechanisms that orchestrate the activities of gene products in cells. Biological networks are context specific and dynamic in nature. To systematically characterize the selectively activated regulatory components and mechanisms, the modeling tools must be able to effectively distinguish si…
▽ More
Modeling biological networks serves as both a major goal and an effective tool of systems biology in studying mechanisms that orchestrate the activities of gene products in cells. Biological networks are context specific and dynamic in nature. To systematically characterize the selectively activated regulatory components and mechanisms, the modeling tools must be able to effectively distinguish significant rewiring from random background fluctuations. We formulated the inference of differential dependency networks that incorporates both conditional data and prior knowledge as a convex optimization problem, and developed an efficient learning algorithm to jointly infer the conserved biological network and the significant rewiring across different conditions. We used a novel sampling scheme to estimate the expected error rate due to random knowledge and based on which, developed a strategy that fully exploits the benefit of this data-knowledge integrated approach. We demonstrated and validated the principle and performance of our method using synthetic datasets. We then applied our method to yeast cell line and breast cancer microarray data and obtained biologically plausible results.
△ Less
Submitted 19 February, 2014; v1 submitted 28 October, 2013;
originally announced October 2013.