-
White matter tract crossing and bottleneck regions in the fetal brain
Authors:
Camilo Calixto,
Matheus D. Soldatelli,
Bo Li,
Lana Pierotich,
Ali Gholipour,
Simon K. Warfield,
Davood Karimi
Abstract:
There is a growing interest in using diffusion MRI to study the white matter tracts and structural connectivity of the fetal brain. Recent progress in data acquisition and processing suggests that this imaging modality has a unique role in elucidating the normal and abnormal patterns of neurodevelopment in utero. However, there have been no efforts to quantify the prevalence of crossing tracts and…
▽ More
There is a growing interest in using diffusion MRI to study the white matter tracts and structural connectivity of the fetal brain. Recent progress in data acquisition and processing suggests that this imaging modality has a unique role in elucidating the normal and abnormal patterns of neurodevelopment in utero. However, there have been no efforts to quantify the prevalence of crossing tracts and bottleneck regions, important issues that have been extensively researched for adult brains. In this work, we determined the brain regions with crossing tracts and bottlenecks between 23 and 36 gestational weeks. We performed probabilistic tractography on 59 fetal brain scans and extracted a set of 51 distinct white tracts, which we grouped into 10 major tract bundle groups. We analyzed the results to determine the patterns of tract crossings and bottlenecks. Our results showed that 20-25% of the white matter voxels included two or three crossing tracts. Bottlenecks were more prevalent. Between 75-80% of the voxels were characterized as bottlenecks, with more than 40% of the voxels involving four or more tracts. The results of this study highlight the challenge of fetal brain tractography and structural connectivity assessment and call for innovative image acquisition and analysis methods to mitigate these problems.
△ Less
Submitted 20 July, 2024;
originally announced August 2024.
-
Retrospective for the Dynamic Sensorium Competition for predicting large-scale mouse primary visual cortex activity from videos
Authors:
Polina Turishcheva,
Paul G. Fahey,
Michaela Vystrčilová,
Laura Hansel,
Rachel Froebe,
Kayla Ponder,
Yongrong Qiu,
Konstantin F. Willeke,
Mohammad Bashiri,
Ruslan Baikulov,
Yu Zhu,
Lei Ma,
Shan Yu,
Tiejun Huang,
Bryan M. Li,
Wolf De Wulf,
Nina Kudryashova,
Matthias H. Hennig,
Nathalie L. Rochefort,
Arno Onken,
Eric Wang,
Zhiwei Ding,
Andreas S. Tolias,
Fabian H. Sinz,
Alexander S Ecker
Abstract:
Understanding how biological visual systems process information is challenging because of the nonlinear relationship between visual input and neuronal responses. Artificial neural networks allow computational neuroscientists to create predictive models that connect biological and machine vision. Machine learning has benefited tremendously from benchmarks that compare different model on the same ta…
▽ More
Understanding how biological visual systems process information is challenging because of the nonlinear relationship between visual input and neuronal responses. Artificial neural networks allow computational neuroscientists to create predictive models that connect biological and machine vision. Machine learning has benefited tremendously from benchmarks that compare different model on the same task under standardized conditions. However, there was no standardized benchmark to identify state-of-the-art dynamic models of the mouse visual system. To address this gap, we established the Sensorium 2023 Benchmark Competition with dynamic input, featuring a new large-scale dataset from the primary visual cortex of ten mice. This dataset includes responses from 78,853 neurons to 2 hours of dynamic stimuli per neuron, together with the behavioral measurements such as running speed, pupil dilation, and eye movements. The competition ranked models in two tracks based on predictive performance for neuronal responses on a held-out test set: one focusing on predicting in-domain natural stimuli and another on out-of-distribution (OOD) stimuli to assess model generalization. As part of the NeurIPS 2023 competition track, we received more than 160 model submissions from 22 teams. Several new architectures for predictive models were proposed, and the winning teams improved the previous state-of-the-art model by 50%. Access to the dataset as well as the benchmarking infrastructure will remain online at www.sensorium-competition.net.
△ Less
Submitted 12 July, 2024;
originally announced July 2024.
-
Leveraging the Human Ventral Visual Stream to Improve Neural Network Robustness
Authors:
Zhenan Shao,
Linjian Ma,
Bo Li,
Diane M. Beck
Abstract:
Human object recognition exhibits remarkable resilience in cluttered and dynamic visual environments. In contrast, despite their unparalleled performance across numerous visual tasks, Deep Neural Networks (DNNs) remain far less robust than humans, showing, for example, a surprising susceptibility to adversarial attacks involving image perturbations that are (almost) imperceptible to humans. Human…
▽ More
Human object recognition exhibits remarkable resilience in cluttered and dynamic visual environments. In contrast, despite their unparalleled performance across numerous visual tasks, Deep Neural Networks (DNNs) remain far less robust than humans, showing, for example, a surprising susceptibility to adversarial attacks involving image perturbations that are (almost) imperceptible to humans. Human object recognition likely owes its robustness, in part, to the increasingly resilient representations that emerge along the hierarchy of the ventral visual cortex. Here we show that DNNs, when guided by neural representations from a hierarchical sequence of regions in the human ventral visual stream, display increasing robustness to adversarial attacks. These neural-guided models also exhibit a gradual shift towards more human-like decision-making patterns and develop hierarchically smoother decision surfaces. Importantly, the resulting representational spaces differ in important ways from those produced by conventional smoothing methods, suggesting that such neural-guidance may provide previously unexplored robustness solutions. Our findings support the gradual emergence of human robustness along the ventral visual hierarchy and suggest that the key to DNN robustness may lie in increasing emulation of the human brain.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Beyond ESM2: Graph-Enhanced Protein Sequence Modeling with Efficient Clustering
Authors:
Shujian Jiao,
Bingxuan Li,
Lei Wang,
Xiaojin Zhang,
Wei Chen,
Jiajie Peng,
Zhongyu Wei
Abstract:
Proteins are essential to life's processes, underpinning evolution and diversity. Advances in sequencing technology have revealed millions of proteins, underscoring the need for sophisticated pre-trained protein models for biological analysis and AI development. Facebook's ESM2, the most advanced protein language model to date, leverages a masked prediction task for unsupervised learning, crafting…
▽ More
Proteins are essential to life's processes, underpinning evolution and diversity. Advances in sequencing technology have revealed millions of proteins, underscoring the need for sophisticated pre-trained protein models for biological analysis and AI development. Facebook's ESM2, the most advanced protein language model to date, leverages a masked prediction task for unsupervised learning, crafting amino acid representations with notable biochemical accuracy. Yet, it lacks in delivering functional protein insights, signaling an opportunity for enhancing representation quality.Our study addresses this gap by incorporating protein family classification into ESM2's training.This approach, augmented with Community Propagation-Based Clustering Algorithm, improves global protein representations, while a contextual prediction task fine-tunes local amino acid accuracy. Significantly, our model achieved state-of-the-art results in several downstream experiments, demonstrating the power of combining global and local methodologies to substantially boost protein representation quality.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Benchmarking the CoW with the TopCoW Challenge: Topology-Aware Anatomical Segmentation of the Circle of Willis for CTA and MRA
Authors:
Kaiyuan Yang,
Fabio Musio,
Yihui Ma,
Norman Juchler,
Johannes C. Paetzold,
Rami Al-Maskari,
Luciano Höher,
Hongwei Bran Li,
Ibrahim Ethem Hamamci,
Anjany Sekuboyina,
Suprosanna Shit,
Houjing Huang,
Chinmay Prabhakar,
Ezequiel de la Rosa,
Diana Waldmannstetter,
Florian Kofler,
Fernando Navarro,
Martin Menten,
Ivan Ezhov,
Daniel Rueckert,
Iris Vos,
Ynte Ruigrok,
Birgitta Velthuis,
Hugo Kuijf,
Julien Hämmerli
, et al. (59 additional authors not shown)
Abstract:
The Circle of Willis (CoW) is an important network of arteries connecting major circulations of the brain. Its vascular architecture is believed to affect the risk, severity, and clinical outcome of serious neuro-vascular diseases. However, characterizing the highly variable CoW anatomy is still a manual and time-consuming expert task. The CoW is usually imaged by two angiographic imaging modaliti…
▽ More
The Circle of Willis (CoW) is an important network of arteries connecting major circulations of the brain. Its vascular architecture is believed to affect the risk, severity, and clinical outcome of serious neuro-vascular diseases. However, characterizing the highly variable CoW anatomy is still a manual and time-consuming expert task. The CoW is usually imaged by two angiographic imaging modalities, magnetic resonance angiography (MRA) and computed tomography angiography (CTA), but there exist limited public datasets with annotations on CoW anatomy, especially for CTA. Therefore we organized the TopCoW Challenge in 2023 with the release of an annotated CoW dataset. The TopCoW dataset was the first public dataset with voxel-level annotations for thirteen possible CoW vessel components, enabled by virtual-reality (VR) technology. It was also the first large dataset with paired MRA and CTA from the same patients. TopCoW challenge formalized the CoW characterization problem as a multiclass anatomical segmentation task with an emphasis on topological metrics. We invited submissions worldwide for the CoW segmentation task, which attracted over 140 registered participants from four continents. The top performing teams managed to segment many CoW components to Dice scores around 90%, but with lower scores for communicating arteries and rare variants. There were also topological mistakes for predictions with high Dice scores. Additional topological analysis revealed further areas for improvement in detecting certain CoW components and matching CoW variant topology accurately. TopCoW represented a first attempt at benchmarking the CoW anatomical segmentation task for MRA and CTA, both morphologically and topologically.
△ Less
Submitted 29 April, 2024; v1 submitted 29 December, 2023;
originally announced December 2023.
-
A Robust Deep Learning Method with Uncertainty Estimation for the Pathological Classification of Renal Cell Carcinoma based on CT Images
Authors:
Ni Yao,
Hang Hu,
Kaicong Chen,
Chen Zhao,
Yuan Guo,
Boya Li,
Jiaofen Nan,
Yanting Li,
Chuang Han,
Fubao Zhu,
Weihua Zhou,
Li Tian
Abstract:
Objectives To develop and validate a deep learning-based diagnostic model incorporating uncertainty estimation so as to facilitate radiologists in the preoperative differentiation of the pathological subtypes of renal cell carcinoma (RCC) based on CT images. Methods Data from 668 consecutive patients, pathologically proven RCC, were retrospectively collected from Center 1. By using five-fold cross…
▽ More
Objectives To develop and validate a deep learning-based diagnostic model incorporating uncertainty estimation so as to facilitate radiologists in the preoperative differentiation of the pathological subtypes of renal cell carcinoma (RCC) based on CT images. Methods Data from 668 consecutive patients, pathologically proven RCC, were retrospectively collected from Center 1. By using five-fold cross-validation, a deep learning model incorporating uncertainty estimation was developed to classify RCC subtypes into clear cell RCC (ccRCC), papillary RCC (pRCC), and chromophobe RCC (chRCC). An external validation set of 78 patients from Center 2 further evaluated the model's performance. Results In the five-fold cross-validation, the model's area under the receiver operating characteristic curve (AUC) for the classification of ccRCC, pRCC, and chRCC was 0.868 (95% CI: 0.826-0.923), 0.846 (95% CI: 0.812-0.886), and 0.839 (95% CI: 0.802-0.88), respectively. In the external validation set, the AUCs were 0.856 (95% CI: 0.838-0.882), 0.787 (95% CI: 0.757-0.818), and 0.793 (95% CI: 0.758-0.831) for ccRCC, pRCC, and chRCC, respectively. Conclusions The developed deep learning model demonstrated robust performance in predicting the pathological subtypes of RCC, while the incorporated uncertainty emphasized the importance of understanding model confidence, which is crucial for assisting clinical decision-making for patients with renal tumors. Clinical relevance statement Our deep learning approach, integrated with uncertainty estimation, offers clinicians a dual advantage: accurate RCC subtype predictions complemented by diagnostic confidence references, promoting informed decision-making for patients with RCC.
△ Less
Submitted 12 November, 2023; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Synthetic CT Generation via Variant Invertible Network for All-digital Brain PET Attenuation Correction
Authors:
Yu Guan,
Bohui Shen,
Xinchong Shi,
Xiangsong Zhang,
Bingxuan Li,
Qiegen Liu
Abstract:
Attenuation correction (AC) is essential for the generation of artifact-free and quantitatively accurate positron emission tomography (PET) images. However, AC of PET faces challenges including inter-scan motion and erroneous transformation of structural voxel-intensities to PET attenuation-correction factors. Nowadays, the problem of AC for quantitative PET have been solved to a large extent afte…
▽ More
Attenuation correction (AC) is essential for the generation of artifact-free and quantitatively accurate positron emission tomography (PET) images. However, AC of PET faces challenges including inter-scan motion and erroneous transformation of structural voxel-intensities to PET attenuation-correction factors. Nowadays, the problem of AC for quantitative PET have been solved to a large extent after the commercial availability of devices combining PET with computed tomography (CT). Meanwhile, considering the feasibility of a deep learning approach for PET AC without anatomical imaging, this paper develops a PET AC method, which uses deep learning to generate continuously valued CT images from non-attenuation corrected PET images for AC on brain PET imaging. Specifically, an invertible network combined with the variable augmentation strategy that can achieve the bidirectional inference processes is proposed for synthetic CT generation (IVNAC). To evaluate the performance of the proposed algorithm, we conducted a comprehensive study on a total of 1440 data from 37 clinical patients using comparative algorithms (such as Cycle-GAN and Pix2pix). Perceptual analysis and quantitative evaluations illustrate that the invertible network for PET AC outperforms other existing AC models, which demonstrates the potential of the proposed method and the feasibility of achieving brain PET AC without CT.
△ Less
Submitted 3 October, 2023;
originally announced October 2023.
-
Infection-induced Cascading Failures -- Impact and Mitigation
Authors:
Bo Li,
David Saad
Abstract:
In the context of epidemic spreading, many intricate dynamical patterns can emerge due to the cooperation of different types of pathogens or the interaction between the disease spread and other failure propagation mechanism. To unravel such patterns, simulation frameworks are usually adopted, but they are computationally demanding on big networks and subject to large statistical uncertainty. Here,…
▽ More
In the context of epidemic spreading, many intricate dynamical patterns can emerge due to the cooperation of different types of pathogens or the interaction between the disease spread and other failure propagation mechanism. To unravel such patterns, simulation frameworks are usually adopted, but they are computationally demanding on big networks and subject to large statistical uncertainty. Here, we study the two-layer spreading processes on unidirectionally dependent networks, where the spreading infection of diseases or malware in one layer can trigger cascading failures in another layer and lead to secondary disasters, e.g., disrupting public services, supply chains, or power distribution. We utilize a dynamic message-passing method to devise efficient algorithms for inferring the system states, which allows one to investigate systematically the nature of complex intertwined spreading processes and evaluate their impact. Based on such dynamic message-passing framework and optimal control, we further develop an effective optimization algorithm for mitigating network failures.
△ Less
Submitted 5 May, 2024; v1 submitted 31 July, 2023;
originally announced July 2023.
-
The Brain Tumor Segmentation (BraTS-METS) Challenge 2023: Brain Metastasis Segmentation on Pre-treatment MRI
Authors:
Ahmed W. Moawad,
Anastasia Janas,
Ujjwal Baid,
Divya Ramakrishnan,
Rachit Saluja,
Nader Ashraf,
Leon Jekel,
Raisa Amiruddin,
Maruf Adewole,
Jake Albrecht,
Udunna Anazodo,
Sanjay Aneja,
Syed Muhammad Anwar,
Timothy Bergquist,
Evan Calabrese,
Veronica Chiang,
Verena Chung,
Gian Marco Marco Conte,
Farouk Dako,
James Eddy,
Ivan Ezhov,
Ariana Familiar,
Keyvan Farahani,
Juan Eugenio Iglesias,
Zhifan Jiang
, et al. (206 additional authors not shown)
Abstract:
The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and chara…
▽ More
The translation of AI-generated brain metastases (BM) segmentation into clinical practice relies heavily on diverse, high-quality annotated medical imaging datasets. The BraTS-METS 2023 challenge has gained momentum for testing and benchmarking algorithms using rigorously annotated internationally compiled real-world datasets. This study presents the results of the segmentation challenge and characterizes the challenging cases that impacted the performance of the winning algorithms. Untreated brain metastases on standard anatomic MRI sequences (T1, T2, FLAIR, T1PG) from eight contributed international datasets were annotated in stepwise method: published UNET algorithms, student, neuroradiologist, final approver neuroradiologist. Segmentations were ranked based on lesion-wise Dice and Hausdorff distance (HD95) scores. False positives (FP) and false negatives (FN) were rigorously penalized, receiving a score of 0 for Dice and a fixed penalty of 374 for HD95. Eight datasets comprising 1303 studies were annotated, with 402 studies (3076 lesions) released on Synapse as publicly available datasets to challenge competitors. Additionally, 31 studies (139 lesions) were held out for validation, and 59 studies (218 lesions) were used for testing. Segmentation accuracy was measured as rank across subjects, with the winning team achieving a LesionWise mean score of 7.9. Common errors among the leading teams included false negatives for small lesions and misregistration of masks in space.The BraTS-METS 2023 challenge successfully curated well-annotated, diverse datasets and identified common errors, facilitating the translation of BM segmentation across varied clinical environments and providing personalized volumetric reports to patients undergoing BM treatment.
△ Less
Submitted 17 June, 2024; v1 submitted 1 June, 2023;
originally announced June 2023.
-
The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs)
Authors:
Anahita Fathi Kazerooni,
Nastaran Khalili,
Xinyang Liu,
Debanjan Haldar,
Zhifan Jiang,
Syed Muhammed Anwar,
Jake Albrecht,
Maruf Adewole,
Udunna Anazodo,
Hannah Anderson,
Sina Bagheri,
Ujjwal Baid,
Timothy Bergquist,
Austin J. Borja,
Evan Calabrese,
Verena Chung,
Gian-Marco Conte,
Farouk Dako,
James Eddy,
Ivan Ezhov,
Ariana Familiar,
Keyvan Farahani,
Shuvanjan Haldar,
Juan Eugenio Iglesias,
Anastasia Janas
, et al. (48 additional authors not shown)
Abstract:
Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20\%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. The MICCA…
▽ More
Pediatric tumors of the central nervous system are the most common cause of cancer-related death in children. The five-year survival rate for high-grade gliomas in children is less than 20\%. Due to their rarity, the diagnosis of these entities is often delayed, their treatment is mainly based on historic treatment concepts, and clinical trials require multi-institutional collaborations. The MICCAI Brain Tumor Segmentation (BraTS) Challenge is a landmark community benchmark event with a successful history of 12 years of resource creation for the segmentation and analysis of adult glioma. Here we present the CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs 2023 challenge, which represents the first BraTS challenge focused on pediatric brain tumors with data acquired across multiple international consortia dedicated to pediatric neuro-oncology and clinical trials. The BraTS-PEDs 2023 challenge focuses on benchmarking the development of volumentric segmentation algorithms for pediatric brain glioma through standardized quantitative performance evaluation metrics utilized across the BraTS 2023 cluster of challenges. Models gaining knowledge from the BraTS-PEDs multi-parametric structural MRI (mpMRI) training data will be evaluated on separate validation and unseen test mpMRI dataof high-grade pediatric glioma. The CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs 2023 challenge brings together clinicians and AI/imaging scientists to lead to faster development of automated segmentation techniques that could benefit clinical trials, and ultimately the care of children with brain tumors.
△ Less
Submitted 23 May, 2024; v1 submitted 26 May, 2023;
originally announced May 2023.
-
Exact analysis of the subthreshold variability for conductance-based neuronal models with synchronous synaptic inputs
Authors:
Logan A. Becker,
Baowang Li,
Nicholas J. Priebe,
Eyal Seidemann,
Thibaud Taillefumier
Abstract:
The spiking activity of neocortical neurons exhibits a striking level of variability, even when these networks are driven by identical stimuli. The approximately Poisson firing of neurons has led to the hypothesis that these neural networks operate in the asynchronous state. In the asynchronous state neurons fire independently from one another, so that the probability that a neuron experience sync…
▽ More
The spiking activity of neocortical neurons exhibits a striking level of variability, even when these networks are driven by identical stimuli. The approximately Poisson firing of neurons has led to the hypothesis that these neural networks operate in the asynchronous state. In the asynchronous state neurons fire independently from one another, so that the probability that a neuron experience synchronous synaptic inputs is exceedingly low. While the models of asynchronous neurons lead to observed spiking variability, it is not clear whether the asynchronous state can also account for the level of subthreshold membrane potential variability. We propose a new analytical framework to rigorously quantify the subthreshold variability of a single conductance-based neuron in response to synaptic inputs with prescribed degrees of synchrony. Technically we leverage the theory of exchangeability to model input synchrony via jump-process-based synaptic drives; we then perform a moment analysis of the stationary response of a neuronal model with all-or-none conductances that neglects post-spiking reset. As a result, we produce exact, interpretable closed forms for the first two stationary moments of the membrane voltage, with explicit dependence on the input synaptic numbers, strengths, and synchrony. For biophysically relevant parameters, we find that the asynchronous regime only yields realistic subthreshold variability (voltage variance $\simeq 4-9\mathrm{mV^2}$) when driven by a restricted number of large synapses, compatible with strong thalamic drive. By contrast, we find that achieving realistic subthreshold variability with dense cortico-cortical inputs requires including weak but nonzero input synchrony, consistent with measured pairwise spiking correlations.
△ Less
Submitted 28 December, 2023; v1 submitted 18 April, 2023;
originally announced April 2023.
-
A Multimodal Graph Neural Network Framework of Cancer Molecular Subtype Classification
Authors:
Bingjun Li,
Sheida Nabavi
Abstract:
The recent development of high-throughput sequencing creates a large collection of multi-omics data, which enables researchers to better investigate cancer molecular profiles and cancer taxonomy based on molecular subtypes. Integrating multi-omics data has been proven to be effective for building more precise classification models. Current multi-omics integrative models mainly use early fusion by…
▽ More
The recent development of high-throughput sequencing creates a large collection of multi-omics data, which enables researchers to better investigate cancer molecular profiles and cancer taxonomy based on molecular subtypes. Integrating multi-omics data has been proven to be effective for building more precise classification models. Current multi-omics integrative models mainly use early fusion by concatenation or late fusion based on deep neural networks. Due to the nature of biological systems, graphs are a better representation of bio-medical data. Although few graph neural network (GNN) based multi-omics integrative methods have been proposed, they suffer from three common disadvantages. One is most of them use only one type of connection, either inter-omics or intra-omic connection; second, they only consider one kind of GNN layer, either graph convolution network (GCN) or graph attention network (GAT); and third, most of these methods lack testing on a more complex cancer classification task. We propose a novel end-to-end multi-omics GNN framework for accurate and robust cancer subtype classification. The proposed model utilizes multi-omics data in the form of heterogeneous multi-layer graphs that combines both inter-omics and intra-omic connections from established biological knowledge. The proposed model incorporates learned graph features and global genome features for accurate classification. We test the proposed model on TCGA Pan-cancer dataset and TCGA breast cancer dataset for molecular subtype and cancer subtype classification, respectively. The proposed model outperforms four current state-of-the-art baseline models in multiple evaluation metrics. The comparative analysis of GAT-based models and GCN-based models reveals that GAT-based models are preferred for smaller graphs with less information and GCN-based models are preferred for larger graphs with extra information.
△ Less
Submitted 23 January, 2024; v1 submitted 24 February, 2023;
originally announced February 2023.
-
The body image of social robots
Authors:
Bing Li,
Oumayma Ajjaji,
Robin Gigandet,
Tatjana Nazir
Abstract:
The rapid development of social robots has challenged robotics and cognitive sciences to understand humans' perception of the appearance of robots. In this study, robot-associated words spontaneously generated by humans were analyzed to semantically reveal the body image of 30 robots that have been developed over the past decades. The analyses took advantage of word affect scales and embedding vec…
▽ More
The rapid development of social robots has challenged robotics and cognitive sciences to understand humans' perception of the appearance of robots. In this study, robot-associated words spontaneously generated by humans were analyzed to semantically reveal the body image of 30 robots that have been developed over the past decades. The analyses took advantage of word affect scales and embedding vectors, and provided a series of evidence for links between human perception and body image. It was found that the valence and dominance of the body image reflected humans' attitude towards the general concept of robots; that the user bases and usages of the robots were among the primary factors influencing humans' impressions towards individual robots; and that there was a relationship between the robots' affects and semantic distances to the word ``person''. According to the results, building body image for robots was an effective paradigm to investigate which features were appreciated by people and what influenced people's feelings towards robots.
△ Less
Submitted 6 February, 2023;
originally announced February 2023.
-
V1T: large-scale mouse V1 response prediction using a Vision Transformer
Authors:
Bryan M. Li,
Isabel M. Cornacchia,
Nathalie L. Rochefort,
Arno Onken
Abstract:
Accurate predictive models of the visual cortex neural response to natural visual stimuli remain a challenge in computational neuroscience. In this work, we introduce V1T, a novel Vision Transformer based architecture that learns a shared visual and behavioral representation across animals. We evaluate our model on two large datasets recorded from mouse primary visual cortex and outperform previou…
▽ More
Accurate predictive models of the visual cortex neural response to natural visual stimuli remain a challenge in computational neuroscience. In this work, we introduce V1T, a novel Vision Transformer based architecture that learns a shared visual and behavioral representation across animals. We evaluate our model on two large datasets recorded from mouse primary visual cortex and outperform previous convolution-based models by more than 12.7% in prediction performance. Moreover, we show that the self-attention weights learned by the Transformer correlate with the population receptive fields. Our model thus sets a new benchmark for neural response prediction and can be used jointly with behavioral and neural recordings to reveal meaningful characteristic features of the visual cortex.
△ Less
Submitted 5 September, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
Kainate receptor modulation by NETO2
Authors:
Lingli He,
Jiahui Sun,
Yiwei Gao,
Bin Li,
Yuhang Wang,
Yanli Dong,
Weidong An,
Hang Li,
Bei Yang,
Yuhan Ge,
Xuejun Cai Zhang,
Yun Stone Shi,
Yan Zhao
Abstract:
Glutamate-gated kainate receptors (KARs) are ubiquitous in the central nervous system of vertebrates, mediate synaptic transmission on post-synapse, and modulate transmitter release on pre-synapse. In the brain, the trafficking, gating kinetics, and pharmacology of KARs are tightly regulated by Neuropilin and tolloid-like proteins (Netos). Here we report cryo-EM structures of homo-tetrameric GluK2…
▽ More
Glutamate-gated kainate receptors (KARs) are ubiquitous in the central nervous system of vertebrates, mediate synaptic transmission on post-synapse, and modulate transmitter release on pre-synapse. In the brain, the trafficking, gating kinetics, and pharmacology of KARs are tightly regulated by Neuropilin and tolloid-like proteins (Netos). Here we report cryo-EM structures of homo-tetrameric GluK2 in complex with Neto2 at inhibited and desensitized states, illustrating variable stoichiometry of GluK2-Neto2 complexes, with one or two Neto2 subunits associate with the GluK2. We find that Neto2 accesses only two broad faces of KARs, intermolecularly crosslinking the lower-lobe of ATDA/C, upper-lobe of LBDB/D, and lower-lobe of LBDA/C, illustrating how Neto2 regulates receptor-gating kinetics. The transmembrane helix of Neto2 is positioned proximal to the selectivity filter and competes with the amphiphilic H1-helix after M4 for interacting with an ICD formed by the M1-M2 linkers of the receptor, revealing how rectification is regulated by Neto2.
△ Less
Submitted 2 February, 2023;
originally announced February 2023.
-
Forecasting West Nile Virus with Graph Neural Networks: Harnessing Spatial Dependence in Irregularly Sampled Geospatial Data
Authors:
Adam Tonks,
Trevor Harris,
Bo Li,
William Brown,
Rebecca Smith
Abstract:
Machine learning methods have seen increased application to geospatial environmental problems, such as precipitation nowcasting, haze forecasting, and crop yield prediction. However, many of the machine learning methods applied to mosquito population and disease forecasting do not inherently take into account the underlying spatial structure of the given data. In our work, we apply a spatially awa…
▽ More
Machine learning methods have seen increased application to geospatial environmental problems, such as precipitation nowcasting, haze forecasting, and crop yield prediction. However, many of the machine learning methods applied to mosquito population and disease forecasting do not inherently take into account the underlying spatial structure of the given data. In our work, we apply a spatially aware graph neural network model consisting of GraphSAGE layers to forecast the presence of West Nile virus in Illinois, to aid mosquito surveillance and abatement efforts within the state. More generally, we show that graph neural networks applied to irregularly sampled geospatial data can exceed the performance of a range of baseline methods including logistic regression, XGBoost, and fully-connected neural networks.
△ Less
Submitted 21 December, 2022;
originally announced December 2022.
-
BatmanNet: Bi-branch Masked Graph Transformer Autoencoder for Molecular Representation
Authors:
Zhen Wang,
Zheng Feng,
Yanjun Li,
Bowen Li,
Yongrui Wang,
Chulin Sha,
Min He,
Xiaolin Li
Abstract:
Although substantial efforts have been made using graph neural networks (GNNs) for AI-driven drug discovery (AIDD), effective molecular representation learning remains an open challenge, especially in the case of insufficient labeled molecules. Recent studies suggest that big GNN models pre-trained by self-supervised learning on unlabeled datasets enable better transfer performance in downstream m…
▽ More
Although substantial efforts have been made using graph neural networks (GNNs) for AI-driven drug discovery (AIDD), effective molecular representation learning remains an open challenge, especially in the case of insufficient labeled molecules. Recent studies suggest that big GNN models pre-trained by self-supervised learning on unlabeled datasets enable better transfer performance in downstream molecular property prediction tasks. However, the approaches in these studies require multiple complex self-supervised tasks and large-scale datasets, which are time-consuming, computationally expensive, and difficult to pre-train end-to-end. Here, we design a simple yet effective self-supervised strategy to simultaneously learn local and global information about molecules, and further propose a novel bi-branch masked graph transformer autoencoder (BatmanNet) to learn molecular representations. BatmanNet features two tailored complementary and asymmetric graph autoencoders to reconstruct the missing nodes and edges, respectively, from a masked molecular graph. With this design, BatmanNet can effectively capture the underlying structure and semantic information of molecules, thus improving the performance of molecular representation. BatmanNet achieves state-of-the-art results for multiple drug discovery tasks, including molecular properties prediction, drug-drug interaction, and drug-target interaction, on 13 benchmark datasets, demonstrating its great potential and superiority in molecular representation learning.
△ Less
Submitted 5 November, 2023; v1 submitted 25 November, 2022;
originally announced November 2022.
-
Can Brain Signals Reveal Inner Alignment with Human Languages?
Authors:
William Han,
Jielin Qiu,
Jiacheng Zhu,
Mengdi Xu,
Douglas Weber,
Bo Li,
Ding Zhao
Abstract:
Brain Signals, such as Electroencephalography (EEG), and human languages have been widely explored independently for many downstream tasks, however, the connection between them has not been well explored. In this study, we explore the relationship and dependency between EEG and language. To study at the representation level, we introduced \textbf{MTAM}, a \textbf{M}ultimodal \textbf{T}ransformer \…
▽ More
Brain Signals, such as Electroencephalography (EEG), and human languages have been widely explored independently for many downstream tasks, however, the connection between them has not been well explored. In this study, we explore the relationship and dependency between EEG and language. To study at the representation level, we introduced \textbf{MTAM}, a \textbf{M}ultimodal \textbf{T}ransformer \textbf{A}lignment \textbf{M}odel, to observe coordinated representations between the two modalities. We used various relationship alignment-seeking techniques, such as Canonical Correlation Analysis and Wasserstein Distance, as loss functions to transfigure features. On downstream applications, sentiment analysis and relation detection, we achieved new state-of-the-art results on two datasets, ZuCo and K-EmoCon. Our method achieved an F1-score improvement of 1.7% on K-EmoCon and 9.3% on Zuco datasets for sentiment analysis, and 7.4% on ZuCo for relation detection. In addition, we provide interpretations of the performance improvement: (1) feature distribution shows the effectiveness of the alignment module for discovering and encoding the relationship between EEG and language; (2) alignment weights show the influence of different language semantics as well as EEG frequency features; (3) brain topographical maps provide an intuitive demonstration of the connectivity in the brain regions. Our code is available at \url{https://github.com/Jason-Qiu/EEG_Language_Alignment}.
△ Less
Submitted 4 May, 2024; v1 submitted 10 August, 2022;
originally announced August 2022.
-
Graph neural networks and attention-based CNN-LSTM for protein classification
Authors:
Zhuangwei Shi,
Bo Li
Abstract:
This paper focuses on three critical problems on protein classification. Firstly, Carbohydrate-active enzyme (CAZyme) classification can help people to understand the properties of enzymes. However, one CAZyme may belong to several classes. This leads to Multi-label CAZyme classification. Secondly, to capture information from the secondary structure of protein, protein classification is modeled as…
▽ More
This paper focuses on three critical problems on protein classification. Firstly, Carbohydrate-active enzyme (CAZyme) classification can help people to understand the properties of enzymes. However, one CAZyme may belong to several classes. This leads to Multi-label CAZyme classification. Secondly, to capture information from the secondary structure of protein, protein classification is modeled as graph classification problem. Thirdly, compound-protein interactions prediction employs graph learning for compound with sequential embedding for protein. This can be seen as classification task for compound-protein pairs. This paper proposes three models for protein classification. Firstly, this paper proposes a Multi-label CAZyme classification model using CNN-LSTM with Attention mechanism. Secondly, this paper proposes a variational graph autoencoder based subspace learning model for protein graph classification. Thirdly, this paper proposes graph isomorphism networks (GIN) and Attention-based CNN-LSTM for compound-protein interactions prediction, as well as comparing GIN with graph convolution networks (GCN) and graph attention networks (GAT) in this task. The proposed models are effective for protein classification. Source code and data are available at https://github.com/zshicode/GNN-AttCL-protein. Besides, this repository collects and collates the benchmark datasets with respect to above problems, including CAZyme classification, enzyme protein graph classification, compound-protein interactions prediction, drug-target affinities prediction and drug-drug interactions prediction. Hence, the usage for evaluation by benchmark datasets can be more conveniently.
△ Less
Submitted 22 February, 2023; v1 submitted 20 April, 2022;
originally announced April 2022.
-
Data-Efficient Graph Grammar Learning for Molecular Generation
Authors:
Minghao Guo,
Veronika Thost,
Beichen Li,
Payel Das,
Jie Chen,
Wojciech Matusik
Abstract:
The problem of molecular generation has received significant attention recently. Existing methods are typically based on deep neural networks and require training on large datasets with tens of thousands of samples. In practice, however, the size of class-specific chemical datasets is usually limited (e.g., dozens of samples) due to labor-intensive experimentation and data collection. This present…
▽ More
The problem of molecular generation has received significant attention recently. Existing methods are typically based on deep neural networks and require training on large datasets with tens of thousands of samples. In practice, however, the size of class-specific chemical datasets is usually limited (e.g., dozens of samples) due to labor-intensive experimentation and data collection. This presents a considerable challenge for the deep learning generative models to comprehensively describe the molecular design space. Another major challenge is to generate only physically synthesizable molecules. This is a non-trivial task for neural network-based generative models since the relevant chemical knowledge can only be extracted and generalized from the limited training data. In this work, we propose a data-efficient generative model that can be learned from datasets with orders of magnitude smaller sizes than common benchmarks. At the heart of this method is a learnable graph grammar that generates molecules from a sequence of production rules. Without any human assistance, these production rules are automatically constructed from training data. Furthermore, additional chemical knowledge can be incorporated in the model by further grammar optimization. Our learned graph grammar yields state-of-the-art results on generating high-quality molecules for three monomer datasets that contain only ${\sim}20$ samples each. Our approach also achieves remarkable performance in a challenging polymer generation task with only $117$ training samples and is competitive against existing methods using $81$k data points. Code is available at https://github.com/gmh14/data_efficient_grammar.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
Periodic Traveling Waves in an Integro-Difference Equation With Non-Monotonic Growth and Strong Allee Effect
Authors:
Michael Nestor,
Bingtuan Li
Abstract:
We derive sufficient conditions for the existence of a periodic traveling wave solution to an integro-difference equation with a piecewise constant growth function exhibiting a stable period2 cycle and strong Allee effect. The mean traveling wave speed is shown to be the asymptotic spreading speed of solutions with compactly supported initial data under appropriate conditions. We then conduct case…
▽ More
We derive sufficient conditions for the existence of a periodic traveling wave solution to an integro-difference equation with a piecewise constant growth function exhibiting a stable period2 cycle and strong Allee effect. The mean traveling wave speed is shown to be the asymptotic spreading speed of solutions with compactly supported initial data under appropriate conditions. We then conduct case studies for the Laplace kernel and uniform kernel.
△ Less
Submitted 6 September, 2024; v1 submitted 1 February, 2022;
originally announced February 2022.
-
Neuronal Learning Analysis using Cycle-Consistent Adversarial Networks
Authors:
Bryan M. Li,
Theoklitos Amvrosiadis,
Nathalie Rochefort,
Arno Onken
Abstract:
Understanding how activity in neural circuits reshapes following task learning could reveal fundamental mechanisms of learning. Thanks to the recent advances in neural imaging technologies, high-quality recordings can be obtained from hundreds of neurons over multiple days or even weeks. However, the complexity and dimensionality of population responses pose significant challenges for analysis. Ex…
▽ More
Understanding how activity in neural circuits reshapes following task learning could reveal fundamental mechanisms of learning. Thanks to the recent advances in neural imaging technologies, high-quality recordings can be obtained from hundreds of neurons over multiple days or even weeks. However, the complexity and dimensionality of population responses pose significant challenges for analysis. Existing methods of studying neuronal adaptation and learning often impose strong assumptions on the data or model, resulting in biased descriptions that do not generalize. In this work, we use a variant of deep generative models called - CycleGAN, to learn the unknown mapping between pre- and post-learning neural activities recorded $\textit{in vivo}$. We develop an end-to-end pipeline to preprocess, train and evaluate calcium fluorescence signals, and a procedure to interpret the resulting deep learning models. To assess the validity of our method, we first test our framework on a synthetic dataset with known ground-truth transformation. Subsequently, we applied our method to neural activities recorded from the primary visual cortex of behaving mice, where the mice transition from novice to expert-level performance in a visual-based virtual reality experiment. We evaluate model performance on generated calcium signals and their inferred spike trains. To maximize performance, we derive a novel approach to pre-sort neurons such that convolutional-based networks can take advantage of the spatial information that exists in neural activities. In addition, we incorporate visual explanation methods to improve the interpretability of our work and gain insights into the learning process as manifested in the cellular activities. Together, our results demonstrate that analyzing neuronal learning processes with data-driven deep unsupervised methods holds the potential to unravel changes in an unbiased way.
△ Less
Submitted 25 November, 2021;
originally announced November 2021.
-
Game-environment feedback dynamics for voluntary prisoner's dilemma games
Authors:
Bin-Quan Li,
Cong Liu,
Zhi-Xi Wu,
Jian-Yue Guan
Abstract:
Recently, the eco-evolutionary game theory which describes the coupled dynamics of strategies and environment have attracted great attention. At the same time, most of the current work is focused on the classic two-player two-strategy game. In this work, we study multi-strategy eco-evolutionary game theory which is an extension of the framework. For simplicity, we'll focus on the voluntary partici…
▽ More
Recently, the eco-evolutionary game theory which describes the coupled dynamics of strategies and environment have attracted great attention. At the same time, most of the current work is focused on the classic two-player two-strategy game. In this work, we study multi-strategy eco-evolutionary game theory which is an extension of the framework. For simplicity, we'll focus on the voluntary participation Prisoner's dilemma game. For the general class of payoff-dependent feedback dynamics, we show the conditions for the existence and stability of internal equilibrium by using the replicator dynamics, respectively. Where internal equilibrium points, such as, two-strategy coexistence states, three-strategy coexistence states, persistent oscillation states and interior saddle points. These states are determined by the relative feedback strength and payoff matrix, and are independent of the relative feedback speed and initial state. In particular, the three-strategy coexistence provides a new mechanism for maintaining biodiversity in biology, ecology, and sociology. Besides, we find that this three-strategy model return to the persistent oscillation state of the two-strategy model when there is no defective strategy at the initial moment.
△ Less
Submitted 18 November, 2021;
originally announced November 2021.
-
A Cloud connected NO2 and Ozone Sensor System for Personalized Pediatric Asthma Research and Management
Authors:
Quan Dong,
Baichen Li,
R. Scott Downen,
Nam Tran,
Elizabeth Chorvinsky,
Dinesh K. Pillai,
Mona E. Zaghloul,
Zhenyu Li
Abstract:
This paper presents a cloud-connected indoor air quality sensor system that can be deployed to patients' homes to study personal microenvironmental exposure for asthma research and management. The system consists of multiple compact sensor units that can measure residential NO2, ozone, humidity, and temperature at one minute resolution and a cloud based informatic system that acquires, stores, and…
▽ More
This paper presents a cloud-connected indoor air quality sensor system that can be deployed to patients' homes to study personal microenvironmental exposure for asthma research and management. The system consists of multiple compact sensor units that can measure residential NO2, ozone, humidity, and temperature at one minute resolution and a cloud based informatic system that acquires, stores, and visualizes the microenvironmental data in real time. The sensor hardware can measure NO2 as low as 10 ppb and ozone at 15 ppb. The cloud informatic system is implemented using open-source software on Amazon Web Service for easy deployment and scalability. This system was successfully deployed to pediatric asthma patients' homes in a pilot study. In this study, we discovered that some families can have short term NO2 exposure higher than EPA's one hour exposure limit (100 ppb), and NO2 micropollution episodes often arise from natural gas appliance usage such as gas stove burning during cooking. By combining the personalized air pollutant exposure measurements with the physiological responses from a patient diary and medical record, this system can enable novel asthma research and personalized asthma management.
△ Less
Submitted 8 January, 2021;
originally announced January 2021.
-
Impact of presymptomatic transmission on epidemic spreading in contact networks: A dynamic message-passing analysis
Authors:
Bo Li,
David Saad
Abstract:
Infectious diseases that incorporate pre-symptomatic transmission are challenging to monitor, model, predict and contain. We address this scenario by studying a variant of a stochastic susceptible-exposed-infected-recovered model on arbitrary network instances using an analytical framework based on the method of dynamic message-passing. This framework provides a good estimate of the probabilistic…
▽ More
Infectious diseases that incorporate pre-symptomatic transmission are challenging to monitor, model, predict and contain. We address this scenario by studying a variant of a stochastic susceptible-exposed-infected-recovered model on arbitrary network instances using an analytical framework based on the method of dynamic message-passing. This framework provides a good estimate of the probabilistic evolution of the spread on both static and contact networks, offering a significantly improved accuracy with respect to individual-based mean-field approaches while requiring a much lower computational cost compared to numerical simulations. It facilitates the derivation of epidemic thresholds, which are phase boundaries separating parameter regimes where infections can be effectively contained from those where they cannot. These have clear implications on different containment strategies through topological (reducing contacts) and infection parameter changes (e.g., social distancing and wearing face masks), with relevance to the recent COVID-19 pandemic.
△ Less
Submitted 6 May, 2021; v1 submitted 27 October, 2020;
originally announced October 2020.
-
A multilayer interstitial fluid flow along vascular adventitia
Authors:
Hongyi Li,
You Lv,
Xiaoliang Chen,
Bei Li,
Qi Hua,
Fusui Ji,
Yajun Yin,
Hua Li
Abstract:
Objective: Interstitial fluid flow through vascular adventitia has been disclosed recently. However, its kinetic pattern was unclear. Methods and Results: We used histological and topographical identifications to observe ISF flow along venous vessels in rabbits. By MRI in alive subjects, the inherent ISF flow pathways in legs, abdomen and thorax were enhanced by paramagnetic contrast from ankle de…
▽ More
Objective: Interstitial fluid flow through vascular adventitia has been disclosed recently. However, its kinetic pattern was unclear. Methods and Results: We used histological and topographical identifications to observe ISF flow along venous vessels in rabbits. By MRI in alive subjects, the inherent ISF flow pathways in legs, abdomen and thorax were enhanced by paramagnetic contrast from ankle dermis. By fluorescence stereomicroscopy and layer-by-layer dissection after the rabbits were sacrificed, the perivascular and adventitial connective tissues (PACT) along the saphenous veins and inferior vena cava were found to be stained by sodium fluorescein from ankle dermis, which coincided with the findings by MRI. By confocal microscopy and histological analysis, the stained PACT pathways were verified to be the fibrous connective tissues and consisted of longitudinally assembled fibers. By usages of nanoparticles and surfactants, a PACT pathway was found to be accessible for a nanoparticle under 100nm and contain two parts: a tunica channel part and an absorptive part. In real-time observations, the calculated velocity of a continuous ISF flow along fibers of a PACT pathway was 3.6-15.6 mm/sec. Conclusion: These data further revealed more kinetic features of a continuous ISF flow along vascular vessel. A multiscale, multilayer, and multiform interstitial/interfacial fluid flow throughout perivascular and adventitial connective tissues was suggested as one of kinetic and dynamic mechanisms for ISF flow, which might be another principal fluid dynamic pattern besides convective/vascular and diffusive transport in biological system.
△ Less
Submitted 23 September, 2020; v1 submitted 23 September, 2020;
originally announced September 2020.
-
Synthesising Realistic Calcium Traces of Neuronal Populations Using GAN
Authors:
Bryan M. Li,
Theoklitos Amvrosiadis,
Nathalie Rochefort,
Arno Onken
Abstract:
Calcium imaging has become a powerful and popular technique to monitor the activity of large populations of neurons in vivo. However, for ethical considerations and despite recent technical developments, recordings are still constrained to a limited number of trials and animals. This limits the amount of data available from individual experiments and hinders the development of analysis techniques…
▽ More
Calcium imaging has become a powerful and popular technique to monitor the activity of large populations of neurons in vivo. However, for ethical considerations and despite recent technical developments, recordings are still constrained to a limited number of trials and animals. This limits the amount of data available from individual experiments and hinders the development of analysis techniques and models for more realistic sizes of neuronal populations. The ability to artificially synthesize realistic neuronal calcium signals could greatly alleviate this problem by scaling up the number of trials. Here, we propose a Generative Adversarial Network (GAN) model to generate realistic calcium signals as seen in neuronal somata with calcium imaging. To this end, we propose CalciumGAN, a model based on the WaveGAN architecture and train it on calcium fluorescent signals with the Wasserstein distance. We test the model on artificial data with known ground-truth and show that the distribution of the generated signals closely resembles the underlying data distribution. Then, we train the model on real calcium traces recorded from the primary visual cortex of behaving mice and confirm that the deconvolved spike trains match the statistics of the recorded data. Together, these results demonstrate that our model can successfully generate realistic calcium traces, thereby providing the means to augment existing datasets of neuronal activity for enhanced data exploration and modelling.
△ Less
Submitted 4 February, 2023; v1 submitted 6 September, 2020;
originally announced September 2020.
-
Identification and Validation of the SNV Biomarkers Based on Multi-Dimensional Patterns
Authors:
Bo Li,
Junying Zhang,
Liang Yu
Abstract:
Background: Single nucleotide variants (SNVs) are detected as different distributions of DNA samples of distinct types of cancer patients. Even though, it is an exacting task to select the appropriate method to identify cancer to the greatest extent of SNVs. Results: In this paper, we proposed a biomarker concept based on SNV patterns in different feature dimensions. Raw dataset (2761 samples) con…
▽ More
Background: Single nucleotide variants (SNVs) are detected as different distributions of DNA samples of distinct types of cancer patients. Even though, it is an exacting task to select the appropriate method to identify cancer to the greatest extent of SNVs. Results: In this paper, we proposed a biomarker concept based on SNV patterns in different feature dimensions. Raw dataset (2761 samples) consisting of twelve different cancers was obtained from TCGA (The Cancer Genome Atlas). After preliminary screening of 562,321 DNA mutation sites in the samples, the mutation sites were extracted and characterized by cancer types in six different SNV feature dimensions. In this study, we found that the extracted features showed similar distribution in the cluster center of the disease type of the samples. After the initial processing of the raw data, the sample was more focused on the subtype distribution of the cancer or the cancer at the SNV level. We used k-nearest neighbors (KNN) to classify the extracted features and Leave-One-Out cross verified them. The accuracy of classifying is stable at around 97% and reached 97.43% at the highest. During the validation phase, we found validated oncogenes in the loci of the features with the highest importance among nine cancers. Conclusions: In summary, the samples showed consistent patterns according to the cancer in which it belongs. It is feasible to classify the cancer of the sample by the distribution of different dimensions of the SNVs and has a high accuracy. And has potential implications for the discovery of cancer-causing genes.
△ Less
Submitted 24 February, 2020;
originally announced February 2020.
-
A Discreet Wearable IoT Sensor for Continuous Transdermal Alcohol Monitoring -- Challenges and Opportunities
Authors:
Baichen Li,
Scott R. Downen,
Quan Dong,
Nam Tran,
Maxine LeSaux,
Andrew C. Meltzer,
Zhenyu Li
Abstract:
Non-invasive continuous alcohol monitoring has potential applications in both population research and in clinical management of acute alcohol intoxication or chronic alcoholism. Current wearable monitors based on transdermal alcohol content (TAC) sensing are relatively bulky and have limited quantification accuracy. Here we describe the development of a discreet wearable transdermal alcohol (TAC)…
▽ More
Non-invasive continuous alcohol monitoring has potential applications in both population research and in clinical management of acute alcohol intoxication or chronic alcoholism. Current wearable monitors based on transdermal alcohol content (TAC) sensing are relatively bulky and have limited quantification accuracy. Here we describe the development of a discreet wearable transdermal alcohol (TAC) sensor in the form of a wristband or armband. This novel sensor can detect vapor-phase alcohol in perspiration from 0.09 ppm (equivalent to 0.09 mg/dL sweat alcohol concentration at 25 °C under Henry's Law equilibrium) to over 500 ppm at one-minute time resolution. The TAC sensor is powered by a 110 mAh lithium battery that lasts for over 7 days. In addition, the sensor can function as a medical "internet-of-things" (IoT) device by connecting to an Android smartphone gateway via Bluetooth Low Energy (BLE) and upload data to a cloud informatics system. Such wearable IoT sensors may enable large-scale alcohol-related research and personalized management. We also present evidence suggesting a hypothesis that perspiration rate is the dominant factor leading to TAC measurement variabilities, which may inform more reproducible and accurate TAC sensor designs in the future.
△ Less
Submitted 13 November, 2019;
originally announced November 2019.
-
Neural networks with motivation
Authors:
Sergey A. Shuvaev,
Ngoc B. Tran,
Marcus Stephenson-Jones,
Bo Li,
Alexei A. Koulakov
Abstract:
How can animals behave effectively in conditions involving different motivational contexts? Here, we propose how reinforcement learning neural networks can learn optimal behavior for dynamically changing motivational salience vectors. First, we show that Q-learning neural networks with motivation can navigate in environment with dynamic rewards. Second, we show that such networks can learn complex…
▽ More
How can animals behave effectively in conditions involving different motivational contexts? Here, we propose how reinforcement learning neural networks can learn optimal behavior for dynamically changing motivational salience vectors. First, we show that Q-learning neural networks with motivation can navigate in environment with dynamic rewards. Second, we show that such networks can learn complex behaviors simultaneously directed towards several goals distributed in an environment. Finally, we show that in Pavlovian conditioning task, the responses of the neurons in our model resemble the firing patterns of neurons in the ventral pallidum (VP), a basal ganglia structure involved in motivated behaviors. We show that, similarly to real neurons, recurrent networks with motivation are composed of two oppositely-tuned classes of neurons, responding to positive and negative rewards. Our model generates predictions for the VP connectivity. We conclude that networks with motivation can rapidly adapt their behavior to varying conditions without changes in synaptic strength when expected reward is modulated by motivation. Such networks may also provide a mechanism for how hierarchical reinforcement learning is implemented in the brain.
△ Less
Submitted 18 November, 2019; v1 submitted 22 June, 2019;
originally announced June 2019.
-
Sequential Bayesian Detection of Spike Activities from Fluorescence Observations
Authors:
Zhuangkun Wei,
Bin Li,
Weisi Guo,
Wenxiu Hu,
Chenglin Zhao
Abstract:
Extracting and detecting spike activities from the fluorescence observations is an important step in understanding how neuron systems work. The main challenge lies in that the combination of the ambient noise with dynamic baseline fluctuation, often contaminates the observations, thereby deteriorating the reliability of spike detection. This may be even worse in the face of the nonlinear biologica…
▽ More
Extracting and detecting spike activities from the fluorescence observations is an important step in understanding how neuron systems work. The main challenge lies in that the combination of the ambient noise with dynamic baseline fluctuation, often contaminates the observations, thereby deteriorating the reliability of spike detection. This may be even worse in the face of the nonlinear biological process, the coupling interactions between spikes and baseline, and the unknown critical parameters of an underlying physiological model, in which erroneous estimations of parameters will affect the detection of spikes causing further error propagation. In this paper, we propose a random finite set (RFS) based Bayesian approach. The dynamic behaviors of spike sequence, fluctuated baseline and unknown parameters are formulated as one RFS. This RFS state is capable of distinguishing the hidden active/silent states induced by spike and non-spike activities respectively, thereby \emph{negating the interaction role} played by spikes and other factors. Then, premised on the RFS states, a Bayesian inference scheme is designed to simultaneously estimate the model parameters, baseline, and crucial spike activities. Our results demonstrate that the proposed scheme can gain an extra $12\%$ detection accuracy in comparison with the state-of-the-art MLSpike method.
△ Less
Submitted 31 January, 2019;
originally announced January 2019.
-
A Wearable IoT Aldehyde Sensor for Pediatric Asthma Research and Management
Authors:
Baichen Li,
Quan Dong,
R. Scott Downen,
Nam Tran,
J. Hunter Jackson,
Dinesh Pillai,
Mona Zaghloul,
Zhenyu Li
Abstract:
Mechanistic studies of pediatric asthma require objective measures of environmental exposure metrics correlated with physiological responses. Here we report a cloud-based wearable IoT sensor system which can measure an asthma patient's exposure to aldehydes, a known class of airway irritants, in real-life settings. The wrist-watch form sensor measures formaldehyde levels in air using fuel cell tec…
▽ More
Mechanistic studies of pediatric asthma require objective measures of environmental exposure metrics correlated with physiological responses. Here we report a cloud-based wearable IoT sensor system which can measure an asthma patient's exposure to aldehydes, a known class of airway irritants, in real-life settings. The wrist-watch form sensor measures formaldehyde levels in air using fuel cell technology, and continuously operate over 7 days without recharging. Sensor data can be retrieved via Bluetooth Low Energy (BLE) communication. A smartphone app was developed as a gateway to transmit data to an informatics system deployed on Amazon Web Services (AWS) for data storage, management and analytics. Potential applications of this IoT sensor system include epidemiological studies of asthma development and exacerbation, personalized asthma management and environmental monitoring.
△ Less
Submitted 18 November, 2018;
originally announced November 2018.
-
Collective cell migration without proliferation: density determines cell velocity and wave velocity
Authors:
S. Tlili,
E. Gauquelin,
B. Li,
O. Cardoso,
B. Ladoux,
H. Delanoë-Ayari,
F. Graner
Abstract:
Collective cell migration contributes to embryogenesis, wound healing and tumor metastasis. Cell monolayer migration experiments help understanding what determines the movement of cells far from the leading edge. Inhibiting cell proliferation limits cell density increase and prevents jamming; we observe long-duration migration and quantify space-time characteristics of the velocity profile over la…
▽ More
Collective cell migration contributes to embryogenesis, wound healing and tumor metastasis. Cell monolayer migration experiments help understanding what determines the movement of cells far from the leading edge. Inhibiting cell proliferation limits cell density increase and prevents jamming; we observe long-duration migration and quantify space-time characteristics of the velocity profile over large length- and time-scales. Velocity waves propagate backwards and their frequency depends only on cell density at the moving front. Both cell average velocity and wave velocity increase linearly with the cell effective radius regardless of the distance to the front. Inhibiting lamellipodia decreases cell velocity while waves either disappear or have a lower frequency. Our model combines conservation laws, monolayer mechanical properties and a phenomenological coupling between strain and polarity: advancing cells pull on their followers which then become polarized. With reasonable values of parameters, this model agrees with several of our experimental observations. Together, our experiments and model disantangle the respective contributions of active velocity and of proliferation in monolayer migration, explain how cells maintain their polarity far from the moving front, and highlight the importance of strain-polarity coupling and density in long-range information propagation.
△ Less
Submitted 23 March, 2018; v1 submitted 17 October, 2016;
originally announced October 2016.
-
Crawling and turning in a minimal reaction-diffusion cell motility model: coupling cell shape and biochemistry
Authors:
Brian A. Camley,
Yanxiang Zhao,
Bo Li,
Herbert Levine,
Wouter-Jan Rappel
Abstract:
We study a minimal model of a crawling eukaryotic cell with a chemical polarity controlled by a reaction-diffusion mechanism describing Rho GTPase dynamics. The size, shape, and speed of the cell emerge from the combination of the chemical polarity, which controls the locations where actin polymerization occurs, and the physical properties of the cell, including its membrane tension. We find in ou…
▽ More
We study a minimal model of a crawling eukaryotic cell with a chemical polarity controlled by a reaction-diffusion mechanism describing Rho GTPase dynamics. The size, shape, and speed of the cell emerge from the combination of the chemical polarity, which controls the locations where actin polymerization occurs, and the physical properties of the cell, including its membrane tension. We find in our model both highly persistent trajectories, in which the cell crawls in a straight line, and turning trajectories, where the cell transitions from crawling in a line to crawling in a circle. We discuss the controlling variables for this turning instability, and argue that turning arises from a coupling between the reaction-diffusion mechanism and the shape of the cell. This emphasizes the surprising features that can arise from simple links between cell mechanics and biochemistry. Our results suggest that similar instabilities may be present in a broad class of biochemical descriptions of cell polarity.
△ Less
Submitted 6 September, 2016;
originally announced September 2016.
-
An expanded evaluation of protein function prediction methods shows an improvement in accuracy
Authors:
Yuxiang Jiang,
Tal Ronnen Oron,
Wyatt T Clark,
Asma R Bankapur,
Daniel D'Andrea,
Rosalba Lepore,
Christopher S Funk,
Indika Kahanda,
Karin M Verspoor,
Asa Ben-Hur,
Emily Koo,
Duncan Penfold-Brown,
Dennis Shasha,
Noah Youngs,
Richard Bonneau,
Alexandra Lin,
Sayed ME Sahraeian,
Pier Luigi Martelli,
Giuseppe Profiti,
Rita Casadio,
Renzhi Cao,
Zhaolong Zhong,
Jianlin Cheng,
Adrian Altenhoff,
Nives Skunca
, et al. (122 additional authors not shown)
Abstract:
Background: The increasing volume and variety of genotypic and phenotypic data is a major defining characteristic of modern biomedical sciences. At the same time, the limitations in technology for generating data and the inherently stochastic nature of biomolecular events have led to the discrepancy between the volume of data and the amount of knowledge gleaned from it. A major bottleneck in our a…
▽ More
Background: The increasing volume and variety of genotypic and phenotypic data is a major defining characteristic of modern biomedical sciences. At the same time, the limitations in technology for generating data and the inherently stochastic nature of biomolecular events have led to the discrepancy between the volume of data and the amount of knowledge gleaned from it. A major bottleneck in our ability to understand the molecular underpinnings of life is the assignment of function to biological macromolecules, especially proteins. While molecular experiments provide the most reliable annotation of proteins, their relatively low throughput and restricted purview have led to an increasing role for computational function prediction. However, accurately assessing methods for protein function prediction and tracking progress in the field remain challenging. Methodology: We have conducted the second Critical Assessment of Functional Annotation (CAFA), a timed challenge to assess computational methods that automatically assign protein function. One hundred twenty-six methods from 56 research groups were evaluated for their ability to predict biological functions using the Gene Ontology and gene-disease associations using the Human Phenotype Ontology on a set of 3,681 proteins from 18 species. CAFA2 featured significantly expanded analysis compared with CAFA1, with regards to data set size, variety, and assessment metrics. To review progress in the field, the analysis also compared the best methods participating in CAFA1 to those of CAFA2. Conclusions: The top performing methods in CAFA2 outperformed the best methods from CAFA1, demonstrating that computational function prediction is improving. This increased accuracy can be attributed to the combined effect of the growing number of experimental annotations and improved methods for function prediction.
△ Less
Submitted 2 January, 2016;
originally announced January 2016.
-
A Much better replacement of the Michaelis-Menten equation and its application
Authors:
Banghe Li,
Bo Li,
Yuefeng Shen
Abstract:
Michaelis-Menten equation is a basic equation of enzyme kinetics and gives an acceptable approximation of real chemical reaction processes. Analyzing the derivation of this equation yields the fact that its good performance of approximating real reaction processes is due to Michaelis-Menten curve (15). This curve is derived from Quasi-Steady-State Assumption(QSSA), which has been proved always tru…
▽ More
Michaelis-Menten equation is a basic equation of enzyme kinetics and gives an acceptable approximation of real chemical reaction processes. Analyzing the derivation of this equation yields the fact that its good performance of approximating real reaction processes is due to Michaelis-Menten curve (15). This curve is derived from Quasi-Steady-State Assumption(QSSA), which has been proved always true and called Quasi-Steady-State Law by Banghe Li et al [19].
Here, we found a quartic equation A(S,E)=0 (22), which gives more accurate approximation of the reaction process in two aspects: during the quasi-steady state of a reaction, Michaelis-Menten curve approximates the reaction well, while our quartic equation $A(S,E)=0$ gives better approximation; near the end of the reaction, our equation approaches the end of the reaction with a tangent line same to that of the reaction, while Michaelis-Menten curve does not. In addition, our quartic equation A(S,E)=0 differs to Michaelis-Menten curve less than the order of $1/S^3$ as S approaches $+\infty$.
By considering the above merits of A(S,E)=0, we suggest it as a replacement of Michaelis-Menten curve. Intuitively, this new equation is more complex and harder to understand. But, just because its complexity, it provides more information about the rate constants than Michaelis-Menten curve does.
Finally, we get a better replacement of the Michaelis-Menten equation by combing A(S,E)=0 and the equation $dP/dt=k_2C(t)$.
△ Less
Submitted 14 May, 2015;
originally announced May 2015.
-
The Increase of the Functional Entropy of the Human Brain with Age
Authors:
Y. Yao,
W. L. Lu,
B. Xu,
C. B. Li,
C. P. Lin,
D. Waxman,
J. F. Feng
Abstract:
We use entropy to characterize intrinsic ageing properties of the human brain. Analysis of fMRI data from a large dataset of individuals, using resting state BOLD signals, demonstrated that a functional entropy associated with brain activity increases with age. During an average lifespan, the entropy, which was calculated from a population of individuals, increased by approximately 0.1 bits, due t…
▽ More
We use entropy to characterize intrinsic ageing properties of the human brain. Analysis of fMRI data from a large dataset of individuals, using resting state BOLD signals, demonstrated that a functional entropy associated with brain activity increases with age. During an average lifespan, the entropy, which was calculated from a population of individuals, increased by approximately 0.1 bits, due to correlations in BOLD activity becoming more widely distributed. We attribute this to the number of excitatory neurons and the excitatory conductance decreasing with age. Incorporating these properties into a computational model leads to quantitatively similar results to the fMRI data. Our dataset involved males and females and we found significant differences between them. The entropy of males at birth was lower than that of females. However, the entropies of the two sexes increase at different rates, and intersect at approximately 50 years; after this age, males have a larger entropy.
△ Less
Submitted 8 June, 2014;
originally announced June 2014.
-
A Smartphone Controlled Handheld Microfluidic Liquid Handling System
Authors:
Baichen Li,
Lin Li,
Allan Guan,
Quan Dong,
Kangcheng Ruan,
Ronggui Hu,
Zhenyu Li
Abstract:
Microfluidics and lab-on-a-chip technologies have made it possible to manipulate small volume liquids with unprecedented resolution, automation and integration. However, most current microfluidic systems still rely on bulky off-chip infrastructures such as compressed pressure sources, syringe pumps and computers to achieve complex liquid manipulation functions. Here, we present a handheld automate…
▽ More
Microfluidics and lab-on-a-chip technologies have made it possible to manipulate small volume liquids with unprecedented resolution, automation and integration. However, most current microfluidic systems still rely on bulky off-chip infrastructures such as compressed pressure sources, syringe pumps and computers to achieve complex liquid manipulation functions. Here, we present a handheld automated microfluidic liquid handling system controlled by a smartphone, which is enabled by combining elastomeric on-chip valves and a compact pneumatic system. As a demonstration, we show that the system can automatically perform all the liquid handling steps of a bead-based sandwich immunoassay on a multi-layer PDMS chip without any human intervention. The footprint of the system is 6 by 10.5 by 16.5cm, and the total weight is 829g including battery. Powered by a 12.8V 1500mAh Li battery, the system consumed 2.2W on average during the immunoassay and lasted for 8.7 hrs. This handheld microfluidic liquid handling platform is generally applicable to many biochemical and cell-based assays requiring complex liquid manipulation and sample preparation steps such as FISH, PCR, flow cytometry and nucleic acid sequencing. In particular, the integration of this technology with read-out biosensors may help enable the realization of the long-sought Tricorder-like handheld in-vitro diagnostic (IVD) systems.
△ Less
Submitted 20 May, 2014;
originally announced May 2014.
-
Periodic migration in a physical model of cells on micropatterns
Authors:
Brian A. Camley,
Yanxiang Zhao,
Bo Li,
Herbert Levine,
Wouter-Jan Rappel
Abstract:
We extend a model for the morphology and dynamics of a crawling eukaryotic cell to describe cells on micropatterned substrates. This model couples cell morphology, adhesion, and cytoskeletal flow in response to active stresses induced by actin and myosin. We propose that protrusive stresses are only generated where the cell adheres, leading to the cell's effective confinement to the pattern. Consi…
▽ More
We extend a model for the morphology and dynamics of a crawling eukaryotic cell to describe cells on micropatterned substrates. This model couples cell morphology, adhesion, and cytoskeletal flow in response to active stresses induced by actin and myosin. We propose that protrusive stresses are only generated where the cell adheres, leading to the cell's effective confinement to the pattern. Consistent with experimental results, simulated cells exhibit a broad range of behaviors, including steady motion, turning, bipedal motion, and periodic migration, in which the cell crawls persistently in one direction before reversing periodically. We show that periodic motion emerges naturally from the coupling of cell polarization to cell shape by reducing the model to a simplified one-dimensional form that can be understood analytically.
△ Less
Submitted 30 September, 2013;
originally announced October 2013.
-
Evaluating strategies of phylogenetic analyses by the coherence of their results
Authors:
Blaise Li
Abstract:
I propose an approach to identify, among several strategies of phylogenetic analysis, those producing the most accurate results. This approach is based on the hypothesis that the more a result is reproduced from independent data, the more it reflects the historical signal common to the analysed data. Under this hypothesis, the capacity of an analytical strategy to extract historical signal should…
▽ More
I propose an approach to identify, among several strategies of phylogenetic analysis, those producing the most accurate results. This approach is based on the hypothesis that the more a result is reproduced from independent data, the more it reflects the historical signal common to the analysed data. Under this hypothesis, the capacity of an analytical strategy to extract historical signal should correlate positively with the coherence of the obtained results. I apply this approach to a series of analyses on empirical data, basing the coherence measure on the Robinson-Foulds distances between the obtained trees. At first approximation, the analytical strategies most suitable for the data produce the most coherent results. However, risks of false positives and false negatives are identified, which are difficult to rule out.
△ Less
Submitted 5 July, 2013;
originally announced July 2013.
-
Why does air passage over forest yield more rain? Examining the coupling between rainfall, pressure and atmospheric moisture content
Authors:
Anastassia M. Makarieva,
Victor G. Gorshkov,
Douglas Sheil,
Antonio D. Nobre,
Peter Bunyard,
Bai-Lian Li
Abstract:
The influence of forest loss on rainfall remains poorly understood. Addressing this challenge Spracklen et al. recently presented a pan-tropical study of rainfall and land-cover that showed that satellite-derived rainfall measures were positively correlated with the degree to which model-derived air trajectories had been exposed to forest cover. This result confirms the influence of vegetation on…
▽ More
The influence of forest loss on rainfall remains poorly understood. Addressing this challenge Spracklen et al. recently presented a pan-tropical study of rainfall and land-cover that showed that satellite-derived rainfall measures were positively correlated with the degree to which model-derived air trajectories had been exposed to forest cover. This result confirms the influence of vegetation on regional rainfall patterns suggested in previous studies. However, we find that the conclusion of Spracklen et al. -- that differences in rainfall reflect air moisture content resulting from evapotranspiration while the circulation pattern remains unchanged -- appears undermined by methodological inconsistencies. We identify methodological problems with the underlying analyses and the quantitative estimates for rainfall change predicted if forest cover is lost in the Amazon. We discuss some alternative explanations that include the distinct role of forest evapotranspiration in creating low pressure systems that draw moisture from the oceans to the continental hinterland. Our analysis of meteorological data from three regions in Brazil, including the central Amazon forest, reveal a tendency for rainy days during the wet season with column water vapor (CWV) exceeding 50 mm to have higher pressure than rainless days; while at lower CWV rainy days tend to have lower pressure than rainless days. The coupling between atmospheric moisture content and circulation dynamics underlines that the danger posed by forest loss is greater than suggested by focusing only on moisture recycling alone.
△ Less
Submitted 17 April, 2013; v1 submitted 14 January, 2013;
originally announced January 2013.
-
Spectral analysis of Gene co-expression network of Zebrafish
Authors:
S. Jalan,
C. Y. Ung,
J. Bhojwani,
B. Li,
L. Zhang,
S. H. Lan,
Z. Gong
Abstract:
We analyze the gene expression data of Zebrafish under the combined framework of complex networks and random matrix theory. The nearest neighbor spacing distribution of the corresponding matrix spectra follows random matrix predictions of Gaussian orthogonal statistics. Based on the eigenvector analysis we can divide the spectra into two parts, first part for which the eigenvector localization pro…
▽ More
We analyze the gene expression data of Zebrafish under the combined framework of complex networks and random matrix theory. The nearest neighbor spacing distribution of the corresponding matrix spectra follows random matrix predictions of Gaussian orthogonal statistics. Based on the eigenvector analysis we can divide the spectra into two parts, first part for which the eigenvector localization properties match with the random matrix theory predictions, and the second part for which they show deviation from the theory and hence are useful to understand the system dependent properties. Spectra with the localized eigenvectors can be characterized into three groups based on the eigenvalues. We explore the position of localized nodes from these different categories. Using an overlap measure, we find that the top contributing nodes in the different groups carry distinguished structural features. Furthermore, the top contributing nodes of the different localized eigenvectors corresponding to the lower eigenvalue regime form different densely connected structure well separated from each other. Preliminary biological interpretation of the genes, associated with the top contributing nodes in the localized eigenvectors, suggests that the genes corresponding to same vector share common features.
△ Less
Submitted 23 August, 2012;
originally announced August 2012.
-
Skeletal Rigidity of Phylogenetic Trees
Authors:
Howard Cheng,
Satyan L. Devadoss,
Brian Li,
Andrej Risteski
Abstract:
Motivated by geometric origami and the straight skeleton construction, we outline a map between spaces of phylogenetic trees and spaces of planar polygons. The limitations of this map is studied through explicit examples, culminating in proving a structural rigidity result.
Motivated by geometric origami and the straight skeleton construction, we outline a map between spaces of phylogenetic trees and spaces of planar polygons. The limitations of this map is studied through explicit examples, culminating in proving a structural rigidity result.
△ Less
Submitted 26 March, 2012;
originally announced March 2012.
-
Reverse engineering of complex dynamical networks in the presence of time-delayed interactions based on noisy time series
Authors:
Wen-Xu Wang,
Jie Ren,
Ying-Cheng Lai,
Baowen Li
Abstract:
Reverse engineering of complex dynamical networks is important for a variety of fields where uncovering the full topology of unknown networks and estimating parameters characterizing the network structure and dynamical processes are of interest. We consider complex oscillator networks with time-delayed interactions in a noisy environment, and develop an effective method to infer the full topology…
▽ More
Reverse engineering of complex dynamical networks is important for a variety of fields where uncovering the full topology of unknown networks and estimating parameters characterizing the network structure and dynamical processes are of interest. We consider complex oscillator networks with time-delayed interactions in a noisy environment, and develop an effective method to infer the full topology of the network and evaluate the amount of time delay based solely on noise- contaminated time series. In particular, we develop an analytic theory establishing that the dynamical correlation matrix, which can be constructed purely from time series, can be manipulated to yield both the network topology and the amount of time delay simultaneously. Extensive numerical support is provided to validate the method. While our method provides a viable solution to the network inverse problem, significant difficulties, limitations, and challenges still remain, and these are discussed thoroughly.
△ Less
Submitted 18 December, 2012; v1 submitted 31 January, 2011;
originally announced January 2011.
-
An exploratory analysis of combined genome-wide SNP data from several recent studies
Authors:
Blaise Li
Abstract:
The usefulness of a `total-evidence' approach to human population genetics was assessed through a clustering analysis of combined genome-wide SNP datasets. The combination contained only 3146 SNPs. Detailed examination of the results nonetheless enables the extraction of relevant clues about the history of human populations, some pertaining to events as ancient as the first migration out of Africa…
▽ More
The usefulness of a `total-evidence' approach to human population genetics was assessed through a clustering analysis of combined genome-wide SNP datasets. The combination contained only 3146 SNPs. Detailed examination of the results nonetheless enables the extraction of relevant clues about the history of human populations, some pertaining to events as ancient as the first migration out of Africa. The results are mostly coherent with what is known from history, linguistics, and previous genetic analyses. These promising results suggest that cross-studies data confrontation have the potential to yield interesting new hypotheses about human population history.
△ Less
Submitted 6 December, 2012; v1 submitted 28 January, 2011;
originally announced January 2011.
-
Understanding and predicting synthetic lethal genetic interactions in Saccharomyces cerevisiae using domain genetic interactions
Authors:
Bo Li,
Weiguo Cao,
Jizhong Zhou,
Feng Luo
Abstract:
Genetic interactions have been widely used to define functional relationships between proteins and pathways. In this study, we demonstrated that yeast synthetic lethal genetic interactions can be explained by the genetic interactions between domains of those proteins. The domain genetic interactions rarely overlap with the domain physical interactions from iPfam database and provide a complementar…
▽ More
Genetic interactions have been widely used to define functional relationships between proteins and pathways. In this study, we demonstrated that yeast synthetic lethal genetic interactions can be explained by the genetic interactions between domains of those proteins. The domain genetic interactions rarely overlap with the domain physical interactions from iPfam database and provide a complementary view about domain relationships. Moreover, we found that domains in multidomain yeast proteins contribute to their genetic interactions differently. The domain genetic interactions help more precisely define the function related to the synthetic lethal genetic interactions, and then help understand how domains contribute to different functionalities of multidomain proteins. Using the probabilities of domain genetic interactions, we were able to predict novel yeast synthetic lethal genetic interactions. Furthermore, we had also identified novel compensatory pathways from the predicted synthetic lethal genetic interactions. Our study significantly improved the understanding of yeast mulitdomain proteins, the synthetic lethal genetic interactions and the functional relationships between proteins and pathways.
△ Less
Submitted 22 April, 2011; v1 submitted 6 January, 2011;
originally announced January 2011.
-
Spectral Properties of Directed Random Networks with Modular Structure
Authors:
Sarika Jalan,
Guimei Zhu,
Baowen Li
Abstract:
We study spectra of directed networks with inhibitory and excitatory couplings. We investigate in particular eigenvector localization properties of various model networks for different value of correlation among their entries. Spectra of random networks, with completely uncorrelated entries show a circular distribution with delocalized eigenvectors, where as networks with correlated entries have l…
▽ More
We study spectra of directed networks with inhibitory and excitatory couplings. We investigate in particular eigenvector localization properties of various model networks for different value of correlation among their entries. Spectra of random networks, with completely uncorrelated entries show a circular distribution with delocalized eigenvectors, where as networks with correlated entries have localized eigenvectors. In order to understand the origin of localization we track the spectra as a function of connection probability and directionality. As connections are made directed, eigenstates start occurring in complex conjugate pairs and the eigenvalue distribution combined with the localization measure shows a rich pattern. Moreover, for a very well distinguished community structure, the whole spectrum is localized except few eigenstates at boundary of the circular distribution. As the network deviates from the community structure there is a sudden change in the localization property for a very small value of deformation from the perfect community structure. We search for this effect for the whole range of correlation strengths and for different community configurations. Furthermore, we investigate spectral properties of a metabolic network of zebrafish, and compare them with those of the model networks.
△ Less
Submitted 18 October, 2011; v1 submitted 31 December, 2010;
originally announced January 2011.
-
Random matrix analysis of localization properties of Gene co-expression network
Authors:
Sarika Jalan,
Norbert Solymosi,
Gabör Vattay,
Baowen Li
Abstract:
We analyze gene co-expression network under the random matrix theory framework. The nearest neighbor spacing distribution of the adjacency matrix of this network follows Gaussian orthogonal statistics of random matrix theory (RMT). Spectral rigidity test follows random matrix prediction for a certain range, and deviates after wards. Eigenvector analysis of the network using inverse participation…
▽ More
We analyze gene co-expression network under the random matrix theory framework. The nearest neighbor spacing distribution of the adjacency matrix of this network follows Gaussian orthogonal statistics of random matrix theory (RMT). Spectral rigidity test follows random matrix prediction for a certain range, and deviates after wards. Eigenvector analysis of the network using inverse participation ratio (IPR) suggests that the statistics of bulk of the eigenvalues of network is consistent with those of the real symmetric random matrix, whereas few eigenvalues are localized. Based on these IPR calculations, we can divide eigenvalues in three sets; (A) The non-degenerate part that follows RMT. (B) The non-degenerate part, at both ends and at intermediate eigenvalues, which deviate from RMT and expected to contain information about {\it important nodes} in the network. (C) The degenerate part with $zero$ eigenvalue, which fluctuates around RMT predicted value. We identify nodes corresponding to the dominant modes of the corresponding eigenvectors and analyze their structural properties.
△ Less
Submitted 9 April, 2010; v1 submitted 27 January, 2010;
originally announced January 2010.
-
Effectively integrating information content and structural relationship to improve the GO-based similarity measure between proteins
Authors:
Bo Li,
James Z. Wang,
F. Alex Feltus,
Jizhong Zhou,
Feng Luo
Abstract:
The Gene Ontology (GO) provides a knowledge base to effectively describe proteins. However, measuring similarity between proteins based on GO remains a challenge. In this paper, we propose a new similarity measure, information coefficient similarity measure (SimIC), to effectively integrate both the information content (IC) of GO terms and the structural information of GO hierarchy to determine…
▽ More
The Gene Ontology (GO) provides a knowledge base to effectively describe proteins. However, measuring similarity between proteins based on GO remains a challenge. In this paper, we propose a new similarity measure, information coefficient similarity measure (SimIC), to effectively integrate both the information content (IC) of GO terms and the structural information of GO hierarchy to determine the similarity between proteins. Testing on yeast proteins, our results show that SimIC efficiently addresses the shallow annotation issue in GO, thus improves the correlations between GO similarities of yeast proteins and their expression similarities as well as between GO similarities of yeast proteins and their sequence similarities. Furthermore, we demonstrate that the proposed SimIC is superior in predicting yeast protein interactions. We predict 20484 yeast protein-protein interactions (PPIs) between 2462 proteins based on the high SimIC values of biological process (BP) and cellular component (CC). Examining the 214 MIPS complexes in our predicted PPIs shows that all members of 159 MIPS complexes can be found in our PPI predictions, which is more than those (120/214) found in PPIs predicted by relative specificity similarity (RSS). Integrating IC and structural information of GO hierarchy can improve the effectiveness of the semantic similarity measure of GO terms. The new SimIC can effectively correct the effect of shallow annotation, and then provide an effective way to measure similarity between proteins based on Gene Ontology.
△ Less
Submitted 6 January, 2010; v1 submitted 6 January, 2010;
originally announced January 2010.
-
Thermodynamic stability of small-world oscillator networks: A case study of proteins
Authors:
Jie Ren,
Baowen Li
Abstract:
We study vibrational thermodynamic stability of small-world oscillator networks, by relating the average mean-square displacement $S$ of oscillators to the eigenvalue spectrum of the Laplacian matrix of networks. We show that the cross-links suppress $S$ effectively and there exist two phases on the small-world networks: 1) an unstable phase: when $p\ll1/N$, $S\sim N$; 2) a stable phase: when…
▽ More
We study vibrational thermodynamic stability of small-world oscillator networks, by relating the average mean-square displacement $S$ of oscillators to the eigenvalue spectrum of the Laplacian matrix of networks. We show that the cross-links suppress $S$ effectively and there exist two phases on the small-world networks: 1) an unstable phase: when $p\ll1/N$, $S\sim N$; 2) a stable phase: when $p\gg1/N$, $S\sim p^{-1}$, \emph{i.e.}, $S/N\sim E_{cr}^{-1}$. Here, $p$ is the parameter of small-world, $N$ is the number of oscillators, and $E_{cr}=pN$ is the number of cross-links. The results are exemplified by various real protein structures that follow the same scaling behavior $S/N\sim E_{cr}^{-1}$ of the stable phase. We also show that it is the "small-world" property that plays the key role in the thermodynamic stability and is responsible for the universal scaling $S/N\sim E_{cr}^{-1}$, regardless of the model details.
△ Less
Submitted 7 May, 2009;
originally announced May 2009.