Search | arXiv e-print repository

DeepLINK-T: deep learning inference for time series data using knockoffs and LSTM

Authors: Wenxuan Zuo, Zifan Zhu, Yuxuan Du, Yi-Chun Yeh, Jed A. Fuhrman, Jinchi Lv, Yingying Fan, Fengzhu Sun

Abstract: High-dimensional longitudinal time series data is prevalent across various real-world applications. Many such applications can be modeled as regression problems with high-dimensional time series covariates. Deep learning has been a popular and powerful tool for fitting these regression models. Yet, the development of interpretable and reproducible deep-learning models is challenging and remains un… ▽ More High-dimensional longitudinal time series data is prevalent across various real-world applications. Many such applications can be modeled as regression problems with high-dimensional time series covariates. Deep learning has been a popular and powerful tool for fitting these regression models. Yet, the development of interpretable and reproducible deep-learning models is challenging and remains underexplored. This study introduces a novel method, Deep Learning Inference using Knockoffs for Time series data (DeepLINK-T), focusing on the selection of significant time series variables in regression while controlling the false discovery rate (FDR) at a predetermined level. DeepLINK-T combines deep learning with knockoff inference to control FDR in feature selection for time series models, accommodating a wide variety of feature distributions. It addresses dependencies across time and features by leveraging a time-varying latent factor structure in time series covariates. Three key ingredients for DeepLINK-T are 1) a Long Short-Term Memory (LSTM) autoencoder for generating time series knockoff variables, 2) an LSTM prediction network using both original and knockoff variables, and 3) the application of the knockoffs framework for variable selection with FDR control. Extensive simulation studies have been conducted to evaluate DeepLINK-T's performance, showing its capability to control FDR effectively while demonstrating superior feature selection power for high-dimensional longitudinal time series data compared to its non-time series counterpart. DeepLINK-T is further applied to three metagenomic data sets, validating its practical utility and effectiveness, and underscoring its potential in real-world applications. △ Less

Submitted 5 April, 2024; originally announced April 2024.

arXiv:2401.01367 [pdf]

Guidelines in Wastewater-based Epidemiology of SARS-CoV-2 with Diagnosis

Authors: Madiha Fatima, Zhihua Cao, Aichun Huang, Shengyuan Wu, Xinxian Fan, Yi Wang, Liu Jiren, Ziyun Zhu, Qiongrou Ye, Yuan Ma, Joseph K. F Chow, Peng Jia, Yangshou Liu, Yubin Lin, Manjun Ye, Tong Wu, Zhixun Li, Cong Cai, Wenhai Zhang, Cheris H. Q. Ding, Yuanzhe Cai, Feijuan Huang

Abstract: With the global spread and increasing transmission rate of SARS-CoV-2, more and more laboratories and researchers are turning their attention to wastewater-based epidemiology (WBE), hoping it can become an effective tool for large-scale testing and provide more ac-curate predictions of the number of infected individuals. Based on the cases of sewage sampling and testing in some regions such as Hon… ▽ More With the global spread and increasing transmission rate of SARS-CoV-2, more and more laboratories and researchers are turning their attention to wastewater-based epidemiology (WBE), hoping it can become an effective tool for large-scale testing and provide more ac-curate predictions of the number of infected individuals. Based on the cases of sewage sampling and testing in some regions such as Hong Kong, Brazil, and the United States, the feasibility of detecting the novel coronavirus in sewage is extremely high. This study re-views domestic and international achievements in detecting SARS-CoV-2 through WBE and summarizes four aspects of COVID-19, including sampling methods, virus decay rate cal-culation, standardized population coverage of the watershed, algorithm prediction, and provides ideas for combining field modeling with epidemic prevention and control. Moreover, we highlighted some diagnostic techniques for detection of the virus from sew-age sample. Our review is a new approach in identification of the research gaps in waste water-based epidemiology and diagnosis and we also predict the future prospect of our analysis. △ Less

Submitted 26 December, 2023; originally announced January 2024.

arXiv:2401.00746 [pdf, other]

Learn to integrate parts for whole through correlated neural variability

Authors: Zhichao Zhu, Yang Qi, Wenlian Lu, Jianfeng Feng

Abstract: Sensory perception originates from the responses of sensory neurons, which react to a collection of sensory signals linked to various physical attributes of a singular perceptual object. Unraveling how the brain extracts perceptual information from these neuronal responses is a pivotal challenge in both computational neuroscience and machine learning. Here we introduce a statistical mechanical the… ▽ More Sensory perception originates from the responses of sensory neurons, which react to a collection of sensory signals linked to various physical attributes of a singular perceptual object. Unraveling how the brain extracts perceptual information from these neuronal responses is a pivotal challenge in both computational neuroscience and machine learning. Here we introduce a statistical mechanical theory, where perceptual information is first encoded in the correlated variability of sensory neurons and then reformatted into the firing rates of downstream neurons. Applying this theory, we illustrate the encoding of motion direction using neural covariance and demonstrate high-fidelity direction recovery by spiking neural networks. Networks trained under this theory also show enhanced performance in classifying natural images, achieving higher accuracy and faster inference speed. Our results challenge the traditional view of neural covariance as a secondary factor in neural coding, highlighting its potential influence on brain function. △ Less

Submitted 1 January, 2024; originally announced January 2024.

Comments: 18 pages, 5 figures

arXiv:2308.04478 [pdf]

EasyMergeR: an interactive Shiny application to manipulate multiple XLSX files of multiple sheets

Authors: Ziyu Zhu, Ximing Xu

Abstract: The integration of sequencing data with clinical information is a widely accepted strategy in bioinformatics and health informatics. Despite advanced databases and sophisticated tools for processing omics data, challenges remain in handling the raw clinical data (typically in XLSX format with multiple sheets inside), either exported from health information system (HIS) or manually collected by inv… ▽ More The integration of sequencing data with clinical information is a widely accepted strategy in bioinformatics and health informatics. Despite advanced databases and sophisticated tools for processing omics data, challenges remain in handling the raw clinical data (typically in XLSX format with multiple sheets inside), either exported from health information system (HIS) or manually collected by investigators. This is particularly difficult for time-constrained medical staff with little or no programming background, and it is typically the first bottleneck in many clinical-oriented studies. To fill this gap, we developed EasyMergeR, a simple, user-friendly, code-free R Shiny application that allows interactive manipulation of multiple XLSX files with multiple sheets and provides basic data manipulation capabilities based on the tidyverse and other handy R packages. △ Less

Submitted 8 August, 2023; originally announced August 2023.

Comments: 6 pages, 1 figure

arXiv:2305.13982 [pdf, other]

Toward stochastic neural computing

Authors: Yang Qi, Zhichao Zhu, Yiming Wei, Lu Cao, Zhigang Wang, Jie Zhang, Wenlian Lu, Jianfeng Feng

Abstract: The highly irregular spiking activity of cortical neurons and behavioral variability suggest that the brain could operate in a fundamentally probabilistic way. Mimicking how the brain implements and learns probabilistic computation could be a key to developing machine intelligence that can think more like humans. In this work, we propose a theory of stochastic neural computing (SNC) in which strea… ▽ More The highly irregular spiking activity of cortical neurons and behavioral variability suggest that the brain could operate in a fundamentally probabilistic way. Mimicking how the brain implements and learns probabilistic computation could be a key to developing machine intelligence that can think more like humans. In this work, we propose a theory of stochastic neural computing (SNC) in which streams of noisy inputs are transformed and processed through populations of nonlinearly coupled spiking neurons. To account for the propagation of correlated neural variability, we derive from first principles a moment embedding for spiking neural network (SNN). This leads to a new class of deep learning model called the moment neural network (MNN) which naturally generalizes rate-based neural networks to second order. As the MNN faithfully captures the stationary statistics of spiking neural activity, it can serve as a powerful proxy for training SNN with zero free parameters. Through joint manipulation of mean firing rate and noise correlations in a task-driven way, the model is able to learn inference tasks while simultaneously minimizing prediction uncertainty, resulting in enhanced inference speed. We further demonstrate the application of our method to Intel's Loihi neuromorphic hardware. The proposed theory of SNC may open up new opportunities for developing machine intelligence capable of computing uncertainty and for designing unconventional computing architectures. △ Less

Submitted 21 April, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

arXiv:2205.15364 [pdf, other]

doi 10.1049/cit2.12194

Associative Learning Mechanism for Drug-Target Interaction Prediction

Authors: Zhiqin Zhu, Zheng Yao, Guanqiu Qi, Neal Mazur, Baisen Cong

Abstract: As a necessary process in drug development, finding a drug compound that can selectively bind to a specific protein is highly challenging and costly. Drug-target affinity (DTA), which represents the strength of drug-target interaction (DTI), has played an important role in the DTI prediction task over the past decade. Although deep learning has been applied to DTA-related research, existing soluti… ▽ More As a necessary process in drug development, finding a drug compound that can selectively bind to a specific protein is highly challenging and costly. Drug-target affinity (DTA), which represents the strength of drug-target interaction (DTI), has played an important role in the DTI prediction task over the past decade. Although deep learning has been applied to DTA-related research, existing solutions ignore fundamental correlations between molecular substructures in molecular representation learning of drug compound molecules/protein targets. Moreover, traditional methods lack the interpretability of the DTA prediction process. This results in missing feature information of intermolecular interactions, thereby affecting prediction performance. Therefore, this paper proposes a DTA prediction method with interactive learning and an autoencoder mechanism. The proposed model enhances the corresponding ability to capture the feature information of a single molecular sequence by the drug/protein molecular representation learning module and supplements the information interaction between molecular sequence pairs by the interactive information learning module. The DTA value prediction module fuses the drug-target pair interaction information to output the predicted value of DTA. Additionally, this paper theoretically proves that the proposed method maximizes evidence lower bound (ELBO) for the joint distribution of the DTA prediction model, which enhances the consistency of the probability distribution between the actual value and the predicted value. The experimental results confirm mutual transformer-drug target affinity (MT-DTA) achieves better performance than other comparative methods. △ Less

Submitted 15 December, 2023; v1 submitted 24 May, 2022; originally announced May 2022.

Comments: The extended and final version of this paper has been published with open access modality in the CAAI Transactions on Intelligence Technology and can be found at link LINK HERE. Please refer to the TRIT published version in your scientific papers

Journal ref: Zhiqin Zhu (2023) 1-20

arXiv:2202.00087 [pdf, other]

Holistic Fine-grained GGS Characterization: From Detection to Unbalanced Classification

Authors: Yuzhe Lu, Haichun Yang, Zuhayr Asad, Zheyu Zhu, Tianyuan Yao, Jiachen Xu, Agnes B. Fogo, Yuankai Huo

Abstract: Recent studies have demonstrated the diagnostic and prognostic values of global glomerulosclerosis (GGS) in IgA nephropathy, aging, and end-stage renal disease. However, the fine-grained quantitative analysis of multiple GGS subtypes (e.g., obsolescent, solidified, and disappearing glomerulosclerosis) is typically a resource extensive manual process. Very few automatic methods, if any, have been d… ▽ More Recent studies have demonstrated the diagnostic and prognostic values of global glomerulosclerosis (GGS) in IgA nephropathy, aging, and end-stage renal disease. However, the fine-grained quantitative analysis of multiple GGS subtypes (e.g., obsolescent, solidified, and disappearing glomerulosclerosis) is typically a resource extensive manual process. Very few automatic methods, if any, have been developed to bridge this gap for such analytics. In this paper, we present a holistic pipeline to quantify GGS (with both detection and classification) from a whole slide image in a fully automatic manner. In addition, we conduct the fine-grained classification for the sub-types of GGS. Our study releases the open-source quantitative analytical tool for fine-grained GGS characterization while tackling the technical challenges in unbalanced classification and integrating detection and classification. △ Less

Submitted 31 January, 2022; originally announced February 2022.

arXiv:2102.12040 [pdf, other]

doi 10.1093/bioinformatics/btab123

Active Learning to Classify Macromolecular Structures in situ for Less Supervision in Cryo-Electron Tomography

Authors: Xuefeng Du, Haohan Wang, Zhenxi Zhu, Xiangrui Zeng, Yi-Wei Chang, Jing Zhang, Min Xu

Abstract: Motivation: Cryo-Electron Tomography (cryo-ET) is a 3D bioimaging tool that visualizes the structural and spatial organization of macromolecules at a near-native state in single cells, which has broad applications in life science. However, the systematic structural recognition and recovery of macromolecules captured by cryo-ET are difficult due to high structural complexity and imaging limits. Dee… ▽ More Motivation: Cryo-Electron Tomography (cryo-ET) is a 3D bioimaging tool that visualizes the structural and spatial organization of macromolecules at a near-native state in single cells, which has broad applications in life science. However, the systematic structural recognition and recovery of macromolecules captured by cryo-ET are difficult due to high structural complexity and imaging limits. Deep learning based subtomogram classification have played critical roles for such tasks. As supervised approaches, however, their performance relies on sufficient and laborious annotation on a large training dataset. Results: To alleviate this major labeling burden, we proposed a Hybrid Active Learning (HAL) framework for querying subtomograms for labelling from a large unlabeled subtomogram pool. Firstly, HAL adopts uncertainty sampling to select the subtomograms that have the most uncertain predictions. Moreover, to mitigate the sampling bias caused by such strategy, a discriminator is introduced to judge if a certain subtomogram is labeled or unlabeled and subsequently the model queries the subtomogram that have higher probabilities to be unlabeled. Additionally, HAL introduces a subset sampling strategy to improve the diversity of the query set, so that the information overlap is decreased between the queried batches and the algorithmic efficiency is improved. Our experiments on subtomogram classification tasks using both simulated and real data demonstrate that we can achieve comparable testing performance (on average only 3% accuracy drop) by using less than 30% of the labeled subtomograms, which shows a very promising result for subtomogram classification task with limited labeling resources. △ Less

Submitted 27 July, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

Comments: Statement on authorship changes: Dr. Eric Xing was an academic advisor of Mr. Haohan Wang. Dr. Xing was not directly involved in this work and has no direct interaction or collaboration with any other authors on this work. Therefore, Dr. Xing is removed from the author list according to his request. Mr. Zhenxi Zhu's affiliation is updated to his current affiliation

arXiv:2101.07654 [pdf, other]

Improve Global Glomerulosclerosis Classification with Imbalanced Data using CircleMix Augmentation

Authors: Yuzhe Lu, Haichun Yang, Zheyu Zhu, Ruining Deng, Agnes B. Fogo, Yuankai Huo

Abstract: The classification of glomerular lesions is a routine and essential task in renal pathology. Recently, machine learning approaches, especially deep learning algorithms, have been used to perform computer-aided lesion characterization of glomeruli. However, one major challenge of developing such methods is the naturally imbalanced distribution of different lesions. In this paper, we propose CircleM… ▽ More The classification of glomerular lesions is a routine and essential task in renal pathology. Recently, machine learning approaches, especially deep learning algorithms, have been used to perform computer-aided lesion characterization of glomeruli. However, one major challenge of developing such methods is the naturally imbalanced distribution of different lesions. In this paper, we propose CircleMix, a novel data augmentation technique, to improve the accuracy of classifying globally sclerotic glomeruli with a hierarchical learning strategy. Different from the recently proposed CutMix method, the CircleMix augmentation is optimized for the ball-shaped biomedical objects, such as glomeruli. 6,861 glomeruli with five classes (normal, periglomerular fibrosis, obsolescent glomerulosclerosis, solidified glomerulosclerosis, and disappearing glomerulosclerosis) were employed to develop and evaluate the proposed methods. From five-fold cross-validation, the proposed CircleMix augmentation achieved superior performance (Balanced Accuracy=73.0%) compared with the EfficientNet-B0 baseline (Balanced Accuracy=69.4%) △ Less

Submitted 16 January, 2021; originally announced January 2021.

arXiv:2008.13561 [pdf]

Sustainable Border Control Policy in the COVID-19 Pandemic: A Math Modeling Study

Authors: Zhen Zhu, Enzo Weber, Till Strohsal, Duaa Serhan

Abstract: Imported COVID-19 cases, if unchecked, can jeopardize the effort of domestic containment. We aim to find out what sustainable border control options for different entities (e.g., countries, states) exist during the reopening phases, given their own choice of domestic control measures and new technologies such as contact tracing. We propose a SUIHR model, which represents an extension to the discre… ▽ More Imported COVID-19 cases, if unchecked, can jeopardize the effort of domestic containment. We aim to find out what sustainable border control options for different entities (e.g., countries, states) exist during the reopening phases, given their own choice of domestic control measures and new technologies such as contact tracing. We propose a SUIHR model, which represents an extension to the discrete time SIR models. The model focuses on studying the spreading of virus predominantly by asymptomatic and pre-symptomatic patients. Imported risk and (1-tier) contact tracing are both built into the model. Under plausible parameter assumptions, we seek sustainable border control policies, in combination with sufficient internal measures, which allow entities to confine the virus without the need to revert back to more restrictive life styles or to rely on herd immunity. When the base reproduction number of COVID-19 exceeds 2.5, even 100% effective contact tracing alone is not enough to contain the spreading. For an entity that has completely eliminated the virus domestically, and resumes "normal", very strict pre-departure screening and test and isolation upon arrival combined with effective contact tracing can only delay another outbreak by 6 months. However, if the total net imported cases are non-increasing, and the entity employs a confining domestic control policy, then the total new cases can be contained even without border control. △ Less

Submitted 4 February, 2021; v1 submitted 28 August, 2020; originally announced August 2020.

Comments: 10 pages, 3 figures and 1 table. A condensed and modified version. Improved writing, no major results changed

arXiv:1908.08807 [pdf, other]

An encoding framework with brain inner state for natural image identification

Authors: Hao Wu, Ziyu Zhu, Jiayi Wang, Nanning Zheng, Badong Chen

Abstract: Neural encoding and decoding, which aim to characterize the relationship between stimuli and brain activities, have emerged as an important area in cognitive neuroscience. Traditional encoding models, which focus on feature extraction and mapping, consider the brain as an input-output mapper without inner states. In this work, inspired by the fact that human brain acts like a state machine, we pro… ▽ More Neural encoding and decoding, which aim to characterize the relationship between stimuli and brain activities, have emerged as an important area in cognitive neuroscience. Traditional encoding models, which focus on feature extraction and mapping, consider the brain as an input-output mapper without inner states. In this work, inspired by the fact that human brain acts like a state machine, we proposed a novel encoding framework that combines information from both the external world and the inner state to predict brain activity. The framework comprises two parts: forward encoding model that deals with visual stimuli and inner state model that captures influence from intrinsic connections in the brain. The forward model can be any traditional encoding model, making the framework flexible. The inner state model is a linear model to utilize information in the prediction residuals of the forward model. The proposed encoding framework can achieve much better performance on natural image identification from fMRI response than forwardonly models. The identification accuracy will decrease slightly with the dataset size increasing, but remain relatively stable with different identification methods. The results confirm that the new encoding framework is effective and robust when used for brain decoding. △ Less

Submitted 22 August, 2019; originally announced August 2019.

arXiv:1711.00045 [pdf]

Retention Time of Peptides in Liquid Chromatography Is Well Estimated upon Deep Transfer Learning

Authors: Chunwei Ma, Zhiyong Zhu, Jun Ye, Jiarui Yang, Jianguo Pei, Shaohang Xu, Chang Yu, Fan Mo, Bo Wen, Siqi Liu

Abstract: A fully automatic prediction for peptide retention time (RT) in liquid chromatography (LC), termed as DeepRT, was developed using deep learning approach, an ensemble of Residual Network (ResNet) and Long Short-Term Memory (LSTM). In contrast to the traditional predictor based on the hand-crafted features for peptides, DeepRT learns features from raw amino acid sequences and makes relatively accura… ▽ More A fully automatic prediction for peptide retention time (RT) in liquid chromatography (LC), termed as DeepRT, was developed using deep learning approach, an ensemble of Residual Network (ResNet) and Long Short-Term Memory (LSTM). In contrast to the traditional predictor based on the hand-crafted features for peptides, DeepRT learns features from raw amino acid sequences and makes relatively accurate prediction of peptide RTs with 0.987 R2 for unmodified peptides. Furthermore, by virtue of transfer learning, DeepRT enables utilization of the peptides datasets generated from different LC conditions and of different modification status, resulting in the RT prediction of 0.992 R2 for unmodified peptides and 0.978 R2 for post-translationally modified peptides. Even though chromatographic behaviors of peptides are quite complicated, the study here demonstrated that peptide RT prediction could be largely improved by deep transfer learning. The DeepRT software is freely available at https://github.com/horsepurve/DeepRT, under Apache2 open source License. △ Less

Submitted 31 October, 2017; originally announced November 2017.

Comments: 13-page research article

arXiv:1705.05368 [pdf]

DeepRT: deep learning for peptide retention time prediction in proteomics

Authors: Chunwei Ma, Zhiyong Zhu, Jun Ye, Jiarui Yang, Jianguo Pei, Shaohang Xu, Ruo Zhou, Chang Yu, Fan Mo, Bo Wen, Siqi Liu

Abstract: Accurate predictions of peptide retention times (RT) in liquid chromatography have many applications in mass spectrometry-based proteomics. Herein, we present DeepRT, a deep learning based software for peptide retention time prediction. DeepRT automatically learns features directly from the peptide sequences using the deep convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) model… ▽ More Accurate predictions of peptide retention times (RT) in liquid chromatography have many applications in mass spectrometry-based proteomics. Herein, we present DeepRT, a deep learning based software for peptide retention time prediction. DeepRT automatically learns features directly from the peptide sequences using the deep convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) model, which eliminates the need to use hand-crafted features or rules. After the feature learning, principal component analysis (PCA) was used for dimensionality reduction, then three conventional machine learning methods were utilized to perform modeling. Two published datasets were used to evaluate the performance of DeepRT and we demonstrate that DeepRT greatly outperforms previous state-of-the-art approaches ELUDE and GPTime. △ Less

Submitted 15 May, 2017; originally announced May 2017.

arXiv:1302.7276 [pdf]

doi 10.1016/j.ygeno.2015.04.002

Role of genetic polymorphisms in transgenerational inheritance in budding yeast

Authors: Zuobin Zhu, Qing Lu, Dejian Yuan, Yanke Li, Xian Man, Yueran Zhu, Shi Huang

Abstract: Transgenerational inheritance of a trait is presumably affected by both genetic and environmental factors but remains poorly understood. We studied the effect of genetic polymorphisms on transgenerational inheritance of yeast segregants that were derived from a cross between a laboratory strain and a wild strain of Saccharomyces cerevisiae. For each SNP analyzed, the parental allele present in les… ▽ More Transgenerational inheritance of a trait is presumably affected by both genetic and environmental factors but remains poorly understood. We studied the effect of genetic polymorphisms on transgenerational inheritance of yeast segregants that were derived from a cross between a laboratory strain and a wild strain of Saccharomyces cerevisiae. For each SNP analyzed, the parental allele present in less than half of the segregants panel was called the minor allele (MA). We found a nonrandom distribution of MAs in the segregants, indicating natural selection. We compared segregants with high MA content (MAC) relative to those with less and found a more dramatic shortening of the lag phase length for the high MAC group in response to 14 days of ethanol training. Also, the short lag phase as acquired and epigenetically memorized by ethanol training was more dramatically lost after 7 days of recovery in ethanol free medium for the high MAC group. Sodium chloride treatment produced similar observations. Using public datasets, we found MAC linkage to mRNA expression of hundreds of genes. Finally, we found preferential effect of MAC on traits with high number of known additive quantitative trait loci (QTLs). These results provide evidence for the slightly deleterious nature of most MAs and a lower capacity to maintain inheritance of traits in individuals or cells with greater MAC, which have implications for disease prevention and treatment and the "missing heritability" problem in complex traits and diseases. △ Less

Submitted 12 July, 2013; v1 submitted 27 February, 2013; originally announced February 2013.

Comments: 22 pages, 3 figures, 1 table, 7 supplementary tables

Journal ref: Genomics, 106: 23-29 (2015)

arXiv:1209.2911 [pdf]

doi 10.1007/s11427-014-4704-4

Methods for scoring the collective effect of SNPs: Minor alleles of common SNPs quantitatively affect traits/diseases and are under both positive and negative selection

Authors: Dejian Yuan, Zuobin Zhu, Xiaohua Tan, Jie Liang, Ceng Zeng, Jiegen Zhang, Jun Chen, Long Ma, Ayca Dogan, Gudrun Brockmann, Oliver Goldmann, Eva Medina, Amanda D. Rice, Richard W. Moyer, Xian Man, Ke Yi, Yanke Li, Qing Lu, Yimin Huang, Dapeng Wang, Jun Yu, Hui Guo, Kun Xia, Shi Huang

Abstract: Most common SNPs are popularly assumed to be neutral. We here developed novel methods to examine in animal models and humans whether extreme amount of minor alleles (MAs) carried by an individual may represent extreme trait values and common diseases. We analyzed panels of genetic reference populations and identified the MAs in each panel and the MA content (MAC) that each strain carried. We also… ▽ More Most common SNPs are popularly assumed to be neutral. We here developed novel methods to examine in animal models and humans whether extreme amount of minor alleles (MAs) carried by an individual may represent extreme trait values and common diseases. We analyzed panels of genetic reference populations and identified the MAs in each panel and the MA content (MAC) that each strain carried. We also analyzed 21 published GWAS datasets of human diseases and identified the MAC of each case or control. MAC was nearly linearly linked to quantitative variations in numerous traits in model organisms, including life span, tumor susceptibility, learning and memory, sensitivity to alcohol and anti-psychotic drugs, and two correlated traits poor reproductive fitness and strong immunity. Similarly, in Europeans or European Americans, enrichment of MAs of fast but not slow evolutionary rate was linked to autoimmune and numerous other diseases, including type 2 diabetes, Parkinson's disease, psychiatric disorders, alcohol and cocaine addictions, cancer, and less life span. Therefore, both high and low MAC correlated with extreme values in many traits, indicating stabilizing selection on most MAs. The methods here are broadly applicable and may help solve the missing heritability problem in complex traits and diseases. △ Less

Submitted 15 July, 2013; v1 submitted 12 September, 2012; originally announced September 2012.

Journal ref: Sci China Life Sci. 57:876-888. (2014)

arXiv:cond-mat/9606101 [pdf, ps, other]

Relationship Between Structural Fractal and Possible Dynamic Scaling Properties in Protein Folding

Authors: Liang-Jian Zou, X. G. Gong, Zheng-Gang Zhu

Abstract: In this letter, the possible dynamic scaling properties of protein molecules in folding are investigated theoretically by assuming that the protein molecules are percolated networks. It is shown that the fractal character and the fractal dimensionality may exist only for short sequences in large protein molecules and small protein molecules with homogeneous structure, the fractal dimensionality… ▽ More In this letter, the possible dynamic scaling properties of protein molecules in folding are investigated theoretically by assuming that the protein molecules are percolated networks. It is shown that the fractal character and the fractal dimensionality may exist only for short sequences in large protein molecules and small protein molecules with homogeneous structure, the fractal dimensionality are obtained for different structures. We then show that there might exist the dynamic scaling properties in protein folding, the critical exponents in the folding for some small global proteins with homogeneous structure are obtained. The dynamic critical exponents of the global proteins in folding are relevant to the fractal dimensionality of its structure, which implies the close relationship between the dynamic process in protein folding and its structure kinematics. △ Less

Submitted 14 June, 1996; originally announced June 1996.

Comments: Latex, 2 Figures, 13 Pages

Showing 1–16 of 16 results for author: Zhu, Z