Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 64 results for author: Xu, Y

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2407.19852  [pdf

    quant-ph cs.LG q-bio.BM

    Quantum Long Short-Term Memory for Drug Discovery

    Authors: Liang Zhang, Yin Xu, Mohan Wu, Liang Wang, Hua Xu

    Abstract: Quantum computing combined with machine learning (ML) is an extremely promising research area, with numerous studies demonstrating that quantum machine learning (QML) is expected to solve scientific problems more effectively than classical ML. In this work, we successfully apply QML to drug discovery, showing that QML can significantly improve model performance and achieve faster convergence compa… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  2. arXiv:2407.09274  [pdf, other

    cs.LG cs.AI q-bio.BM

    Unifying Sequences, Structures, and Descriptions for Any-to-Any Protein Generation with the Large Multimodal Model HelixProtX

    Authors: Zhiyuan Chen, Tianhao Chen, Chenggang Xie, Yang Xue, Xiaonan Zhang, Jingbo Zhou, Xiaomin Fang

    Abstract: Proteins are fundamental components of biological systems and can be represented through various modalities, including sequences, structures, and textual descriptions. Despite the advances in deep learning and scientific large language models (LLMs) for protein research, current methodologies predominantly focus on limited specialized tasks -- often predicting one protein modality from another. Th… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2407.01649  [pdf, other

    q-bio.QM cs.LG

    FAFE: Immune Complex Modeling with Geodesic Distance Loss on Noisy Group Frames

    Authors: Ruidong Wu, Ruihan Guo, Rui Wang, Shitong Luo, Yue Xu, Jiahan Li, Jianzhu Ma, Qiang Liu, Yunan Luo, Jian Peng

    Abstract: Despite the striking success of general protein folding models such as AlphaFold2(AF2, Jumper et al. (2021)), the accurate computational modeling of antibody-antigen complexes remains a challenging task. In this paper, we first analyze AF2's primary loss function, known as the Frame Aligned Point Error (FAPE), and raise a previously overlooked issue that FAPE tends to face gradient vanishing probl… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  4. arXiv:2406.19611  [pdf, other

    q-bio.QM cs.AI

    Multimodal Data Integration for Precision Oncology: Challenges and Future Directions

    Authors: Huajun Zhou, Fengtao Zhou, Chenyu Zhao, Yingxue Xu, Luyang Luo, Hao Chen

    Abstract: The essence of precision oncology lies in its commitment to tailor targeted treatments and care measures to each patient based on the individual characteristics of the tumor. The inherent heterogeneity of tumors necessitates gathering information from diverse data sources to provide valuable insights from various perspectives, fostering a holistic comprehension of the tumor. Over the past decade,… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 15 pages, 4 figures

  5. arXiv:2406.05743  [pdf, other

    cs.NE q-bio.BM

    Peptide Vaccine Design by Evolutionary Multi-Objective Optimization

    Authors: Dan-Xuan Liu, Yi-Heng Xu, Chao Qian

    Abstract: Peptide vaccines are growing in significance for fighting diverse diseases. Machine learning has improved the identification of peptides that can trigger immune responses, and the main challenge of peptide vaccine design now lies in selecting an effective subset of peptides due to the allelic diversity among individuals. Previous works mainly formulated this task as a constrained optimization prob… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: This paper has appeared at IJCAI'24

  6. arXiv:2404.10260  [pdf, other

    q-bio.BM cs.AI

    HelixFold-Multimer: Elevating Protein Complex Structure Prediction to New Heights

    Authors: Xiaomin Fang, Jie Gao, Jing Hu, Lihang Liu, Yang Xue, Xiaonan Zhang, Kunrui Zhu

    Abstract: While monomer protein structure prediction tools boast impressive accuracy, the prediction of protein complex structures remains a daunting challenge in the field. This challenge is particularly pronounced in scenarios involving complexes with protein chains from different species, such as antigen-antibody interactions, where accuracy often falls short. Limited by the accuracy of complex predictio… ▽ More

    Submitted 17 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  7. arXiv:2404.00044  [pdf, other

    physics.chem-ph cs.AI cs.LG q-bio.QM

    UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment

    Authors: Kaipeng Zeng, Bo yang, Xin Zhao, Yu Zhang, Fan Nie, Xiaokang Yang, Yaohui Jin, Yanyan Xu

    Abstract: Motivation: Retrosynthesis planning poses a formidable challenge in the organic chemical industry. Single-step retrosynthesis prediction, a crucial step in the planning process, has witnessed a surge in interest in recent years due to advancements in AI for science. Various deep learning-based methods have been proposed for this task in recent years, incorporating diverse levels of additional chem… ▽ More

    Submitted 19 April, 2024; v1 submitted 24 March, 2024; originally announced April 2024.

  8. arXiv:2402.19190  [pdf, ps, other

    q-bio.PE

    Prediction of vaccination coverage level in the heterogeneous mixing population

    Authors: Fan Bai, Qianyu Chen, Yizhuo Xu

    Abstract: Heterogeneity of population is a key factor in modeling the transmission of disease among the population and has huge impact on the outcome of the transmission. In order to investigate the decision making process in the heterogeneous mixing population regarding whether to be vaccinated or not, we propose the modeling framework which includes the epidemic models and the game theoretical analysis. W… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  9. arXiv:2402.16901  [pdf, other

    q-bio.GN cs.AI cs.LG

    FGBERT: Function-Driven Pre-trained Gene Language Model for Metagenomics

    Authors: ChenRui Duan, Zelin Zang, Yongjie Xu, Hang He, Zihan Liu, Zijia Song, Ju-Sheng Zheng, Stan Z. Li

    Abstract: Metagenomic data, comprising mixed multi-species genomes, are prevalent in diverse environments like oceans and soils, significantly impacting human health and ecological functions. However, current research relies on K-mer representations, limiting the capture of structurally relevant gene contexts. To address these limitations and further our understanding of complex relationships between metage… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  10. arXiv:2401.01059  [pdf, other

    q-bio.QM

    Accelerating Discovery of Novel and Bioactive Ligands With Pharmacophore-Informed Generative Models

    Authors: Weixin Xie, Jianhang Zhang, Qin Xie, Chaojun Gong, Youjun Xu, Luhua Lai, Jianfeng Pei

    Abstract: Deep generative models have gained significant advancements to accelerate drug discovery by generating bioactive chemicals against desired targets. Nevertheless, most generated compounds that have been validated for potent bioactivity often exhibit structural novelty levels that fall short of satisfaction, thereby providing limited inspiration to human medicinal chemists. The challenge faced by ge… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  11. arXiv:2309.16994  [pdf

    q-bio.GN

    A rigorous benchmarking of methods for SARS-CoV-2 lineage abundance estimation in wastewater

    Authors: Viorel Munteanu, Victor Gordeev, Michael Saldana, Eva Aßmann, Justin Maine Su, Nicolae Drabcinski, Oksana Zlenko, Maryna Kit, Felicia Iordachi, Khooshbu Kantibhai Patel, Abdullah Al Nahid, Likhitha Chittampalli, Yidian Xu, Pavel Skums, Shelesh Agrawal, Martin Hölzer, Adam Smith, Alex Zelikovsky, Serghei Mangul

    Abstract: In light of the continuous transmission and evolution of SARS-CoV-2 coupled with a significant decline in clinical testing, there is a pressing need for scalable, cost-effective, long-term, passive surveillance tools to effectively monitor viral variants circulating in the population. Wastewater genomic surveillance of SARS-CoV-2 has arrived as an alternative to clinical genomic surveillance, allo… ▽ More

    Submitted 21 January, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: For correspondence: serghei.mangul@gmail.com

  12. arXiv:2309.10063  [pdf, other

    q-bio.NC cs.AI

    Survey of Consciousness Theory from Computational Perspective

    Authors: Zihan Ding, Xiaoxi Wei, Yidan Xu

    Abstract: Human consciousness has been a long-lasting mystery for centuries, while machine intelligence and consciousness is an arduous pursuit. Researchers have developed diverse theories for interpreting the consciousness phenomenon in human brains from different perspectives and levels. This paper surveys several main branches of consciousness theories originating from different subjects including inform… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  13. arXiv:2309.07165  [pdf

    q-bio.PE

    Revive, Restore, Revitalize: An Eco-economic Methodology for Maasai Mara

    Authors: Yipeng Xu, He Sun, Junfeng Zhu

    Abstract: The Maasai Mara in Kenya, renowned for its biodiversity, is witnessing ecosystem degradation and species endangerment due to intensified human activities. Addressing this, we introduce a dynamic system harmonizing ecological and human priorities. Our agent-based model replicates the Maasai Mara savanna ecosystem, incorporating 71 animal species, 10 human classifications, and 2 natural resource typ… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: 25 pages, 16 figures

  14. MC-NN: An End-to-End Multi-Channel Neural Network Approach for Predicting Influenza A Virus Hosts and Antigenic Types

    Authors: Yanhua Xu, Dominik Wojtczak

    Abstract: Influenza poses a significant threat to public health, particularly among the elderly, young children, and people with underlying dis-eases. The manifestation of severe conditions, such as pneumonia, highlights the importance of preventing the spread of influenza. An accurate and cost-effective prediction of the host and antigenic sub-types of influenza A viruses is essential to addressing this is… ▽ More

    Submitted 21 February, 2024; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted version submitted to the SN Computer Science; Published in the SN Computer Science 2023; V2: minor updates were made to the Results section; V3: minor updates regarding data description; V4: correct the time stamps mentioned in the legends of Figures 1 and 2

  15. arXiv:2304.00970  [pdf, other

    q-bio.BM cs.LG q-bio.QM

    Development and Evaluation of Conformal Prediction Methods for QSAR

    Authors: Yuting Xu, Andy Liaw, Robert P. Sheridan, Vladimir Svetnik

    Abstract: The quantitative structure-activity relationship (QSAR) regression model is a commonly used technique for predicting biological activities of compounds using their molecular descriptors. Predictions from QSAR models can help, for example, to optimize molecular structure; prioritize compounds for further experimental testing; and estimate their toxicity. In addition to the accurate estimation of th… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  16. arXiv:2303.17706  [pdf, other

    eess.IV q-bio.QM

    Label Propagation via Random Walk for Training Robust Thalamus Nuclei Parcellation Model from Noisy Annotations

    Authors: Anqi Feng, Yuan Xue, Yuli Wang, Chang Yan, Zhangxing Bian, Muhan Shao, Jiachen Zhuo, Rao P. Gullapalli, Aaron Carass, Jerry L. Prince

    Abstract: Data-driven thalamic nuclei parcellation depends on high-quality manual annotations. However, the small size and low contrast changes among thalamic nuclei, yield annotations that are often incomplete, noisy, or ambiguously labelled. To train a robust thalamic nuclei parcellation model with noisy annotations, we propose a label propagation algorithm based on random walker to refine the annotations… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  17. arXiv:2303.09007  [pdf

    cs.LG cs.AI q-bio.QM

    Machine Learning for Flow Cytometry Data Analysis

    Authors: Yanhua Xu

    Abstract: Flow cytometry mainly used for detecting the characteristics of a number of biochemical substances based on the expression of specific markers in cells. It is particularly useful for detecting membrane surface receptors, antigens, ions, or during DNA/RNA expression. Not only can it be employed as a biomedical research tool for recognising distinctive types of cells in mixed populations, but it can… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: MSc thesis

  18. arXiv:2302.12398  [pdf

    q-bio.NC

    Characterize the non-Gaussian diffusion property of cerebrospinal fluid using Diffusion Kurtosis Imaging and explore its diagnostic efficacy for Alzheimer's disease

    Authors: Yingnan Xue, Min Wen, Qiong Ye

    Abstract: Differentiating Alzheimer's disease (AD) patients from healthy controls (HCs) remains a challenge. The changes of protein level in cerebrospinal fluid (CSF) of AD patients have been reported in the literature. Macromolecules will hinder the movement of water in CSF and lead to non-Gaussian diffusion. Diffusion kurtosis imaging (DKI) is a commonly used technique for quantifying non-Gaussian diffusi… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

    Comments: 3 tables, 6 figures

  19. arXiv:2211.16742  [pdf, other

    q-bio.QM cs.AI cs.LG

    Protein Language Models and Structure Prediction: Connection and Progression

    Authors: Bozhen Hu, Jun Xia, Jiangbin Zheng, Cheng Tan, Yufei Huang, Yongjie Xu, Stan Z. Li

    Abstract: The prediction of protein structures from sequences is an important task for function prediction, drug design, and related biological processes understanding. Recent advances have proved the power of language models (LMs) in processing the protein sequence databases, which inherit the advantages of attention networks and capture useful information in learning representations for proteins. The past… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

  20. arXiv:2211.00551  [pdf, other

    q-bio.TO cs.CE eess.IV physics.med-ph

    Data-driven generation of 4D velocity profiles in the aneurysmal ascending aorta

    Authors: Simone Saitta, Ludovica Maga, Chloe Armour, Emiliano Votta, Declan P. O'Regan, M. Yousuf Salmasi, Thanos Athanasiou, Jonathan W. Weinsaft, Xiao Yun Xu, Selene Pirola, Alberto Redaelli

    Abstract: Numerical simulations of blood flow are a valuable tool to investigate the pathophysiology of ascending thoracic aortic aneurysms (ATAA). To accurately reproduce hemodynamics, computational fluid dynamics (CFD) models must employ realistic inflow boundary conditions (BCs). However, the limited availability of in vivo velocity measurements still makes researchers resort to idealized BCs. In this st… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

    Comments: 21 pages, 5 figures, 2 tables To be submitted to "Computer methods and programs in biomedicine" Scripts: https://github.com/saitta-s/flow4D Synthetic velocity profiles: //doi.org/10.5281/zenodo.7251987

  21. arXiv:2210.01765  [pdf, other

    cs.LG q-bio.BM stat.ML

    One Transformer Can Understand Both 2D & 3D Molecular Data

    Authors: Shengjie Luo, Tianlang Chen, Yixian Xu, Shuxin Zheng, Tie-Yan Liu, Liwei Wang, Di He

    Abstract: Unlike vision and language data which usually has a unique format, molecules can naturally be characterized using different chemical formulations. One can view a molecule as a 2D graph or define it as a collection of atoms located in a 3D space. For molecular representation learning, most previous works designed neural networks only for a particular data format, making the learned models likely to… ▽ More

    Submitted 27 March, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 20 pages; ICLR 2023, Camera Ready Version; Code: https://github.com/lsj2408/Transformer-M

  22. arXiv:2209.07405  [pdf

    q-bio.BM cs.LG

    Widely Used and Fast De Novo Drug Design by a Protein Sequence-Based Reinforcement Learning Model

    Authors: Yaqin Li, Lingli Li, Yongjin Xu, Yi Yu

    Abstract: De novo molecular design has facilitated the exploration of large chemical space to accelerate drug discovery. Structure-based de novo method can overcome the data scarcity of active ligands by incorporating drug-target interaction into deep generative architectures. However, these strategies are bottlenecked by the small fraction of experimentally determined protein or complex structures. In addi… ▽ More

    Submitted 14 August, 2022; originally announced September 2022.

  23. arXiv:2209.05240  [pdf, ps, other

    math.DS math-ph q-bio.PE

    Dynamics of COVID-19 models with asymptomatic infections and quarantine measures

    Authors: Songbai Guo, Yuling Xue, Xiliang Li, Zuohuan Zheng

    Abstract: Considering the propagation characteristics of COVID-19 in different regions, the dynamics analysis and numerical demonstration of long-term and short-term models of COVID-19 are carried out, respectively. The long-term model is devoted to investigate the global stability of COVID-19 model with asymptomatic infections and quarantine measures. By using the limit system of the model and Lyapunov fun… ▽ More

    Submitted 6 November, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

    MSC Class: 34D23; 37N25; 92D30

  24. arXiv:2207.05477  [pdf, other

    cs.DC cs.LG q-bio.BM

    HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle

    Authors: Guoxia Wang, Xiaomin Fang, Zhihua Wu, Yiqun Liu, Yang Xue, Yingfei Xiang, Dianhai Yu, Fan Wang, Yanjun Ma

    Abstract: Accurate protein structure prediction can significantly accelerate the development of life science. The accuracy of AlphaFold2, a frontier end-to-end structure prediction system, is already close to that of the experimental determination techniques. Due to the complex model architecture and large memory consumption, it requires lots of computational resources and time to implement the training and… ▽ More

    Submitted 13 July, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

  25. arXiv:2206.03823  [pdf, other

    q-bio.QM cs.LG

    Multi-channel neural networks for predicting influenza A virus hosts and antigenic types

    Authors: Yanhua Xu, Dominik Wojtczak

    Abstract: Influenza occurs every season and occasionally causes pandemics. Despite its low mortality rate, influenza is a major public health concern, as it can be complicated by severe diseases like pneumonia. A fast, accurate and low-cost method to predict the origin host and subtype of influenza viruses could help reduce virus transmission and benefit resource-poor areas. In this work, we propose multi-c… ▽ More

    Submitted 29 July, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

    Comments: Accepted for publication at IC3K (KDIR) 2022

  26. Accurate Virus Identification with Interpretable Raman Signatures by Machine Learning

    Authors: Jiarong Ye, Yin-Ting Yeh, Yuan Xue, Ziyang Wang, Na Zhang, He Liu, Kunyan Zhang, RyeAnne Ricker, Zhuohang Yu, Allison Roder, Nestor Perea Lopez, Lindsey Organtini, Wallace Greene, Susan Hafenstein, Huaguang Lu, Elodie Ghedin, Mauricio Terrones, Shengxi Huang, Sharon Xiaolei Huang

    Abstract: Rapid identification of newly emerging or circulating viruses is an important first step toward managing the public health response to potential outbreaks. A portable virus capture device coupled with label-free Raman Spectroscopy holds the promise of fast detection by rapidly obtaining the Raman signature of a virus followed by a machine learning approach applied to recognize the virus based on i… ▽ More

    Submitted 5 June, 2022; originally announced June 2022.

    Comments: 23 pages, 8 figures

    Journal ref: Proceedings of the National Academy of Sciences of the United States of America (2022)

  27. arXiv:2205.15560  [pdf, other

    q-bio.PE math.DS

    A novel analysis approach of uniform persistence for a COVID-19 model with quarantine and standard incidence rate

    Authors: Songbai Guo, Yuling Xue, Xiliang Li, Zuohuan Zheng

    Abstract: A coronavirus disease 2019 (COVID-19) model with quarantine and standard incidence rate is first developed, then a novel analysis approach for finding the ultimate lower bound of COVID-19 infectious individuals is proposed, which means that the COVID-19 pandemic is uniformly persistent if the control reproduction number $\mathcal{R}_{c}>1$. This approach can be applied to other related biomathemat… ▽ More

    Submitted 31 October, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

    Comments: 13 pages, 1 figure

    MSC Class: 34D05; 37N25; 92D30

  28. arXiv:2205.11016  [pdf, other

    cs.CV q-bio.QM

    MolMiner: You only look once for chemical structure recognition

    Authors: Youjun Xu, Jinchuan Xiao, Chia-Han Chou, Jianhang Zhang, Jintao Zhu, Qiwan Hu, Hemin Li, Ningsheng Han, Bingyu Liu, Shuaipeng Zhang, Jinyu Han, Zhen Zhang, Shuhao Zhang, Weilin Zhang, Luhua Lai, Jianfeng Pei

    Abstract: Molecular structures are always depicted as 2D printed form in scientific documents like journal papers and patents. However, these 2D depictions are not machine-readable. Due to a backlog of decades and an increasing amount of these printed literature, there is a high demand for the translation of printed depictions into machine-readable formats, which is known as Optical Chemical Structure Recog… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

    Comments: 19 pages, 4 figures

  29. arXiv:2203.06714  [pdf, other

    cs.LG cs.SI q-bio.MN

    A Survey on Deep Graph Generation: Methods and Applications

    Authors: Yanqiao Zhu, Yuanqi Du, Yinkai Wang, Yichen Xu, Jieyu Zhang, Qiang Liu, Shu Wu

    Abstract: Graphs are ubiquitous in encoding relational information of real-world objects in many domains. Graph generation, whose purpose is to generate new graphs from a distribution similar to the observed graphs, has received increasing attention thanks to the recent advances of deep learning models. In this paper, we conduct a comprehensive review on the existing literature of deep graph generation from… ▽ More

    Submitted 6 December, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

    Comments: Accepted to the First Learning on Graphs Conference (LoG 2022)

  30. arXiv:2112.04814  [pdf, other

    q-bio.BM cs.LG

    Multimodal Pre-Training Model for Sequence-based Prediction of Protein-Protein Interaction

    Authors: Yang Xue, Zijing Liu, Xiaomin Fang, Fan Wang

    Abstract: Protein-protein interactions (PPIs) are essentials for many biological processes where two or more proteins physically bind together to achieve their functions. Modeling PPIs is useful for many biomedical applications, such as vaccine design, antibody therapeutics, and peptide drug discovery. Pre-training a protein model to learn effective representation is critical for PPIs. Most pre-training mod… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: MLCB 2021 Spotlight

  31. arXiv:2111.01351  [pdf, other

    q-bio.NC cs.LG

    Major Depressive Disorder Recognition and Cognitive Analysis Based on Multi-layer Brain Functional Connectivity Networks

    Authors: Xiaofang Sun, Xiangwei Zheng, Yonghui Xu, Lizhen Cui, Bin Hu

    Abstract: On the increase of major depressive disorders (MDD), many researchers paid attention to their recognition and treatment. Existing MDD recognition algorithms always use a single time-frequency domain method method, but the single time-frequency domain method is too simple and is not conducive to simulating the complex link relationship between brain functions. To solve this problem, this paper prop… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

    Journal ref: International Workshop on AI for Cognitive and Physical Frailty Workshop in Conjunction with IJCAI 2021 (AIF-IJCAI'21)

  32. arXiv:2110.10918  [pdf

    q-bio.QM

    Deep Learning Model of Dock by Dock Process Significantly Accelerate the Process of Docking-based Virtual Screening

    Authors: Wei Ma, Qin Xie, Jianhang Zhang, Shiliang Li, Youjun Xu, Xiaobing Deng, Weilin Zhang

    Abstract: Docking-based virtual screening (VS process) selects ligands with potential pharmacological activities from millions of molecules using computational docking methods, which greatly could reduce the number of compounds for experimental screening, shorten the research period and save the research cost. Howerver, a majority of compouds with low docking scores could waste most of the computational res… ▽ More

    Submitted 25 October, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: 25 pages, 7 figures

  33. arXiv:2109.03309  [pdf

    q-bio.QM cs.LG

    CRNNTL: convolutional recurrent neural network and transfer learning for QSAR modelling

    Authors: Yaqin Li, Yongjin Xu, Yi Yu

    Abstract: In this study, we propose the convolutional recurrent neural network and transfer learning (CRNNTL) for QSAR modelling. The method was inspired by the applications of polyphonic sound detection and electrocardiogram classification. Our strategy takes advantages of both convolutional and recurrent neural networks for feature extraction, as well as the data augmentation method. Herein, CRNNTL is eva… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

  34. arXiv:2105.01238  [pdf, other

    cs.LG q-bio.QM

    Supervised multi-specialist topic model with applications on large-scale electronic health record data

    Authors: Ziyang Song, Xavier Sumba Toral, Yixin Xu, Aihua Liu, Liming Guo, Guido Powell, Aman Verma, David Buckeridge, Ariane Marelli, Yue Li

    Abstract: Motivation: Electronic health record (EHR) data provides a new venue to elucidate disease comorbidities and latent phenotypes for precision medicine. To fully exploit its potential, a realistic data generative process of the EHR data needs to be modelled. We present MixEHR-S to jointly infer specialist-disease topics from the EHR data. As the key contribution, we model the specialist assignments a… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  35. arXiv:2104.12320  [pdf, other

    q-bio.GN

    Integration of Unpaired Single-cell Chromatin Accessibility and Gene Expression Data via Adversarial Learning

    Authors: Yang Xu, Andrew Jeremiah Strick

    Abstract: Deep learning has empowered analysis for single-cell sequencing data in many ways and has generated deep understanding about a range of complex cellular systems. As the booming single-cell sequencing technologies brings the surge of high dimensional data that come from different sources and represent cellular systems with different features, there is an equivalent rise and challenge of integrating… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

  36. arXiv:2101.01532  [pdf

    stat.AP physics.bio-ph physics.soc-ph q-bio.PE

    Bayesian data assimilation for estimating epidemic evolution: a COVID-19 study

    Authors: Xian Yang, Shuo Wang, Yuting Xing, Ling Li, Richard Yi Da Xu, Karl J. Friston, Yike Guo

    Abstract: The evolution of epidemiological parameters, such as instantaneous reproduction number Rt, is important for understanding the transmission dynamics of infectious diseases. Current estimates of time-varying epidemiological parameters often face problems such as lagging observations, averaging inference, and improper quantification of uncertainties. To address these problems, we propose a Bayesian d… ▽ More

    Submitted 24 October, 2021; v1 submitted 22 December, 2020; originally announced January 2021.

    Comments: Xian Yang, Shuo Wang and Yuting Xing contribute equally

  37. arXiv:2006.08058  [pdf

    q-bio.GN

    EDGE COVID-19: A Web Platform to generate submission-ready genomes for SARS-CoV-2 sequencing efforts

    Authors: Chien-Chi Lo, Migun Shakya, Karen Davenport, Mark Flynn, Adán Myers y Gutiérrez, Bin Hu, Po-E Li, Elais Player Jackson, Yan Xu, Patrick S. G. Chain

    Abstract: Genomics has become an essential technology for surveilling emerging infectious disease outbreaks. A wide range of technologies and strategies for pathogen genome enrichment and sequencing are being used by laboratories worldwide, together with different, and sometimes ad hoc, analytical procedures for generating genome sequences. As a result, public repositories now contain non-standard entries o… ▽ More

    Submitted 24 June, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

  38. arXiv:2006.04566  [pdf

    q-bio.GN q-bio.QM

    A Public Website for the Automated Assessment and Validation of SARS-CoV-2 Diagnostic PCR Assays

    Authors: Po-E Li, Adán Myers y Gutiérrez, Karen Davenport, Mark Flynn, Bin Hu, Chien-Chi Lo, Elais Player Jackson, Migun Shakya, Yan Xu, Jason Gans, Patrick S. G. Chain

    Abstract: Summary: Polymerase chain reaction-based assays are the current gold standard for detecting and diagnosing SARS-CoV-2. However, as SARS-CoV-2 mutates, we need to constantly assess whether existing PCR-based assays will continue to detect all known viral strains. To enable the continuous monitoring of SARS-CoV-2 assays, we have developed a web-based assay validation algorithm that checks existing P… ▽ More

    Submitted 8 June, 2020; originally announced June 2020.

    Comments: Application Note. Main: 2 pages, 1 figure. Supplementary: 6 pages, 8 figures, 1 table. Total: 8 pages, 9 figures, 1 table. Application url: https://covid19.edgebioinformatics.org/#/assayValidation Contact: Jason Gans (jgans@lanl.gov) and Patrick Chain (pchain@lanl.gov) Submitted to: Bioinformatics

  39. arXiv:2005.14597  [pdf, ps, other

    q-bio.NC physics.bio-ph

    Noise induces continuous and noncontinuous transitions in neuronal interspike intervals range

    Authors: P R Protachevicz, M S Santos, E G Seifert, E C Gabrick, F S Borges, R R Borges, J Trobia, J D Szezech Jr, K C Iarosz, I L Caldas, C G Antonopoulos, Y Xu, R L Viana, A M Batista

    Abstract: Noise appears in the brain due to various sources, such as ionic channel fluctuations and synaptic events. They affect the activities of the brain and influence neuron action potentials. Stochastic differential equations have been used to model firing patterns of neurons subject to noise. In this work, we consider perturbing noise in the adaptive exponential integrate-and-fire (AEIF) neuron. The A… ▽ More

    Submitted 29 May, 2020; originally announced May 2020.

  40. arXiv:2005.12993  [pdf, ps, other

    q-bio.PE physics.soc-ph stat.ME

    Estimating the Number of Infected Cases in COVID-19 Pandemic

    Authors: Donghui Yan, Ying Xu, Pei Wang

    Abstract: The COVID-19 pandemic has caused major disturbance to human life. An important reason behind the widespread social anxiety is the huge uncertainty about the pandemic. A fundamental uncertainty is how many or what percentage of people have been infected. There are published and frequently updated data on various statistics of the pandemic, at local, country or global level. However, due to various… ▽ More

    Submitted 3 March, 2021; v1 submitted 24 May, 2020; originally announced May 2020.

    Comments: 20 pages, 10 figures

  41. arXiv:2004.04874  [pdf

    q-bio.GN q-bio.BM

    Implications of the virus-encoded miRNA and host miRNA in the pathogenicity of SARS-CoV-2

    Authors: Zhi Liu, Jianwei Wang, Yuyu Xu, Mengchen Guo, Kai Mi, Rui Xu, Yang Pei, Qiangkun Zhang, Xiaoting Luan, Zhibin Hu, Xingyin Liu#

    Abstract: The outbreak of COVID-19 caused by SARS-CoV-2 has rapidly spread worldwide and has caused over 1,400,000 infections and 80,000 deaths. There are currently no drugs or vaccines with proven efficacy for its prevention and little knowledge was known about the pathogenicity mechanism of SARS-CoV-2 infection. Previous studies showed both virus and host-derived MicroRNAs (miRNAs) played crucial roles in… ▽ More

    Submitted 9 April, 2020; originally announced April 2020.

    Comments: 24 pages,7 figures and 2 supplementary figures

  42. arXiv:2003.05580  [pdf

    q-bio.PE cs.CE

    COVID-19 Evolves in Human Hosts

    Authors: Yanni Li, Bing Liu, Zhi Wang, Jiangtao Cui, Kaicheng Yao, Pengfan Lv, Yulong Shen, Yueshen Xu, Yuanfang Guan, Xiaoke Ma

    Abstract: Today, we are all threatened by an unprecedented pandemic: COVID-19. How different is it from other coronaviruses? Will it be attenuated or become more virulent? Which animals may be its original host? In this study, we collected and analyzed nearly thirty thousand publicly available complete genome sequences for COVID-19 virus from 79 different countries, the previously known flu-causing coronavi… ▽ More

    Submitted 15 August, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

  43. arXiv:2001.00520  [pdf

    eess.IV physics.optics q-bio.QM

    3D Deep Learning Enables Fast Imaging of Spines through Scattering Media by Temporal Focusing Microscopy

    Authors: Zhun Wei, Josiah R. Boivin, Yi Xue, Xudong Chen, Peter T. C. So, Elly Nedivi, Dushan N. Wadduwage

    Abstract: Today the gold standard for in vivo imaging through scattering tissue is the point-scanning two-photon microscope (PSTPM). Especially in neuroscience, PSTPM is widely used for deep-tissue imaging in the brain. However, due to sequential scanning, PSTPM is slow. Temporal focusing microscopy (TFM), on the other hand, focuses femtosecond pulsed laser light temporally, while keeping wide-field illumin… ▽ More

    Submitted 24 December, 2019; originally announced January 2020.

  44. arXiv:1906.02308  [pdf

    q-bio.QM

    Automatic Retrosynthetic Pathway Planning Using Template-free Models

    Authors: Kangjie Lin, Youjun Xu, Jianfeng Pei, Luhua Lai

    Abstract: We present an attention-based Transformer model for automatic retrosynthesis route planning. Our approach starts from reactants prediction of single-step organic reactions for given products, followed by Monte Carlo tree search-based automatic retrosynthetic pathway prediction. Trained on two datasets from the United States patent literature, our models achieved a top-1 prediction accuracy of over… ▽ More

    Submitted 21 May, 2019; originally announced June 2019.

  45. arXiv:1901.06794  [pdf

    q-bio.GN cs.CE q-bio.QM

    Dual Graph-Laplacian PCA: A Closed-Form Solution for Bi-clustering to Find "Checkerboard" Structures on Gene Expression Data

    Authors: Jin-Xing Liu, Chun-Mei Feng, Xiang-Zhen Kong, Yong Xu

    Abstract: In the context of cancer, internal "checkerboard" structures are normally found in the matrices of gene expression data, which correspond to genes that are significantly up- or down-regulated in patients with specific types of tumors. In this paper, we propose a novel method, called dual graph-regularization principal component analysis (DGPCA). The main innovation of this method is that it simult… ▽ More

    Submitted 21 January, 2019; originally announced January 2019.

    Comments: This manuscript was submitted in IEEE Transaction on Knowledge and Data Engineering on 12/01/2017. 9 pages, 3 figures

  46. arXiv:1806.01467  [pdf

    q-bio.BM

    Directed Non-Targeted Mass Spectrometry and Chemical Networking for Discovery of Eicosanoids

    Authors: Jeramie D. Watrous, Teemu Niiranen, Kim A. Lagerborg, Mir Henglin, Yong-Jian Xu, Sonia Sharma, Ramachandran S. Vasan, Martin G. Larson, Aaron Armando, Oswald Quehenberger, Edward A. Dennis, Susan Cheng, Mohit Jain

    Abstract: Eicosanoids and related species are critical, small bioactive mediators of human physiology and inflammation. While ~1100 distinct eicosanoids have been predicted to exist, to date, less than 150 of these molecules have been measured in humans, limiting our understanding of eicosanoids and their role in human biology. Using a directed non-targeted mass spectrometry approach in conjunction with com… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

  47. Deep Reinforcement Learning of Cell Movement in the Early Stage of C. elegans Embryogenesis

    Authors: Zi Wang, Dali Wang, Chengcheng Li, Yichi Xu, Husheng Li, Zhirong Bao

    Abstract: Cell movement in the early phase of C. elegans development is regulated by a highly complex process in which a set of rules and connections are formulated at distinct scales. Previous efforts have shown that agent-based, multi-scale modeling systems can integrate physical and biological rules and provide new avenues to study developmental systems. However, the application of these systems to model… ▽ More

    Submitted 2 March, 2018; v1 submitted 14 January, 2018; originally announced January 2018.

    Comments: We revised the manuscript to make it clearer to follow. Please notice that the Abstract shown in this page is slightly different than that in the manuscript due to the limitation of 1920 characters in arxiv.org

    Report number: bty323

    Journal ref: Bioinformatics, 2018

  48. arXiv:1711.00629  [pdf, other

    stat.ML cs.LG q-bio.NC

    Sleep Stage Classification Based on Multi-level Feature Learning and Recurrent Neural Networks via Wearable Device

    Authors: Xin Zhang, Weixuan Kou, Eric I-Chao Chang, He Gao, Yubo Fan, Yan Xu

    Abstract: This paper proposes a practical approach for automatic sleep stage classification based on a multi-level feature learning framework and Recurrent Neural Network (RNN) classifier using heart rate and wrist actigraphy derived from a wearable device. The feature learning framework is designed to extract low- and mid-level features. Low-level features capture temporal and frequency domain properties a… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

    Comments: 11 pages, 10 figures

  49. arXiv:1705.03998  [pdf, other

    cs.LG q-bio.QM

    Mining Functional Modules by Multiview-NMF of Phenome-Genome Association

    Authors: YaoGong Zhang, YingJie Xu, Xin Fan, YuXiang Hong, Jiahui Liu, ZhiCheng He, YaLou Huang, MaoQiang Xie

    Abstract: Background: Mining gene modules from genomic data is an important step to detect gene members of pathways or other relations such as protein-protein interactions. In this work, we explore the plausibility of detecting gene modules by factorizing gene-phenotype associations from a phenotype ontology rather than the conventionally used gene expression data. In particular, the hierarchical structure… ▽ More

    Submitted 10 May, 2017; originally announced May 2017.

  50. arXiv:1705.03094  [pdf

    q-bio.GN q-bio.QM

    DeepMetabolism: A Deep Learning System to Predict Phenotype from Genome Sequencing

    Authors: Weihua Guo, You Xu, Xueyang Feng

    Abstract: Life science is entering a new era of petabyte-level sequencing data. Converting such big data to biological insights represents a huge challenge for computational analysis. To this end, we developed DeepMetabolism, a biology-guided deep learning system to predict cell phenotypes from transcriptomics data. By integrating unsupervised pre-training with supervised training, DeepMetabolism is able to… ▽ More

    Submitted 8 May, 2017; originally announced May 2017.