Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 238 results for author: Li, Y

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2408.16975  [pdf, other

    q-bio.BM cs.AI cs.LG

    Technical Report of HelixFold3 for Biomolecular Structure Prediction

    Authors: Lihang Liu, Shanzhuo Zhang, Yang Xue, Xianbin Ye, Kunrui Zhu, Yuxin Li, Yang Liu, Wenlai Zhao, Hongkun Yu, Zhihua Wu, Xiaonan Zhang, Xiaomin Fang

    Abstract: The AlphaFold series has transformed protein structure prediction with remarkable accuracy, often matching experimental methods. AlphaFold2, AlphaFold-Multimer, and the latest AlphaFold3 represent significant strides in predicting single protein chains, protein complexes, and biomolecular structures. While AlphaFold2 and AlphaFold-Multimer are open-sourced, facilitating rapid and reliable predicti… ▽ More

    Submitted 8 September, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

  2. arXiv:2407.16715  [pdf

    q-bio.QM cs.AI cs.LG

    Research on Adverse Drug Reaction Prediction Model Combining Knowledge Graph Embedding and Deep Learning

    Authors: Yufeng Li, Wenchao Zhao, Bo Dang, Xu Yan, Weimin Wang, Min Gao, Mingxuan Xiao

    Abstract: In clinical treatment, identifying potential adverse reactions of drugs can help assist doctors in making medication decisions. In response to the problems in previous studies that features are high-dimensional and sparse, independent prediction models need to be constructed for each adverse reaction of drugs, and the prediction accuracy is low, this paper develops an adverse drug reaction predict… ▽ More

    Submitted 27 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: 12 pages, 4 figures, 9 tables

  3. arXiv:2407.16684  [pdf, other

    eess.IV cs.CV q-bio.NC

    AutoRG-Brain: Grounded Report Generation for Brain MRI

    Authors: Jiayu Lei, Xiaoman Zhang, Chaoyi Wu, Lisong Dai, Ya Zhang, Yanyong Zhang, Yanfeng Wang, Weidi Xie, Yuehua Li

    Abstract: Radiologists are tasked with interpreting a large number of images in a daily base, with the responsibility of generating corresponding reports. This demanding workload elevates the risk of human error, potentially leading to treatment delays, increased healthcare costs, revenue loss, and operational inefficiencies. To address these challenges, we initiate a series of work on grounded Automatic Re… ▽ More

    Submitted 29 July, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

  4. arXiv:2407.15301  [pdf, other

    stat.ML cs.LG math.ST q-bio.QM

    U-learning for Prediction Inference via Combinatory Multi-Subsampling: With Applications to LASSO and Neural Networks

    Authors: Zhe Fei, Yi Li

    Abstract: Epigenetic aging clocks play a pivotal role in estimating an individual's biological age through the examination of DNA methylation patterns at numerous CpG (Cytosine-phosphate-Guanine) sites within their genome. However, making valid inferences on predicted epigenetic ages, or more broadly, on predictions derived from high-dimensional inputs, presents challenges. We introduce a novel U-learning a… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  5. arXiv:2407.14020  [pdf, other

    q-bio.NC cs.LG

    NeuroBind: Towards Unified Multimodal Representations for Neural Signals

    Authors: Fengyu Yang, Chao Feng, Daniel Wang, Tianye Wang, Ziyao Zeng, Zhiyang Xu, Hyoungseob Park, Pengliang Ji, Hanbin Zhao, Yuanning Li, Alex Wong

    Abstract: Understanding neural activity and information representation is crucial for advancing knowledge of brain function and cognition. Neural activity, measured through techniques like electrophysiology and neuroimaging, reflects various aspects of information processing. Recent advances in deep neural networks offer new approaches to analyzing these signals using pre-trained models. However, challenges… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  6. arXiv:2407.13118  [pdf, other

    q-bio.NC stat.CO

    Evaluating the evolution and inter-individual variability of infant functional module development from 0 to 5 years old

    Authors: Lingbin Bian, Nizhuan Wang, Yuanning Li, Adeel Razi, Qian Wang, Han Zhang, Dinggang Shen, the UNC/UMN Baby Connectome Project Consortium

    Abstract: The segregation and integration of infant brain networks undergo tremendous changes due to the rapid development of brain function and organization. Traditional methods for estimating brain modularity usually rely on group-averaged functional connectivity (FC), often overlooking individual variability. To address this, we introduce a novel approach utilizing Bayesian modeling to analyze the dynami… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  7. arXiv:2407.12051  [pdf, other

    q-bio.GN cs.AI cs.LG

    Dy-mer: An Explainable DNA Sequence Representation Scheme using Sparse Recovery

    Authors: Zhiyuan Peng, Yuanbo Tang, Yang Li

    Abstract: DNA sequences encode vital genetic and biological information, yet these unfixed-length sequences cannot serve as the input of common data mining algorithms. Hence, various representation schemes have been developed to transform DNA sequences into fixed-length numerical representations. However, these schemes face difficulties in learning high-quality representations due to the complexity and spar… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  8. arXiv:2407.09922  [pdf

    q-bio.NC

    Transcranial low-level laser stimulation in near infrared-II region for brain safety and protection

    Authors: Zhilin Li, Yongheng Zhao, Yiqing Hu, Yang Li, Keyao Zhang, Zhibing Gao, Lirou Tan, Hanli Liu, Xiaoli Li, Aihua Cao, Zaixu Cui, Chenguang Zhao

    Abstract: Background: The use of near-infrared lasers for transcranial photobiomodulation (tPBM) offers a non-invasive method for influencing brain activity and is beneficial for various neurological conditions. Objective: To investigate the safety and neuroprotective properties of tPBM using near-infrared (NIR)-II laser stimulation. Methods: We conducted thirteen experiments involving multidimensional and… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  9. arXiv:2407.09811  [pdf, other

    cs.AI cs.HC q-bio.GN

    CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis

    Authors: Yihang Xiao, Jinyi Liu, Yan Zheng, Xiaohan Xie, Jianye Hao, Mingzhi Li, Ruitao Wang, Fei Ni, Yuxiao Li, Jintian Luo, Shaoqing Jiao, Jiajie Peng

    Abstract: Single-cell RNA sequencing (scRNA-seq) data analysis is crucial for biological research, as it enables the precise characterization of cellular heterogeneity. However, manual manipulation of various tools to achieve desired outcomes can be labor-intensive for researchers. To address this, we introduce CellAgent (http://cell.agent4science.cn/), an LLM-driven multi-agent framework, specifically desi… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

  10. High-Performance Sorting-Based k-mer Counting in Distributed Memory with Flexible Hybrid Parallelism

    Authors: Yifan Li, Giulia Guidi

    Abstract: In generating large quantities of DNA data, high-throughput sequencing technologies require advanced bioinformatics infrastructures for efficient data analysis. k-mer counting, the process of quantifying the frequency of fixed-length k DNA subsequences, is a fundamental step in various bioinformatics pipelines, including genome assembly and protein prediction. Due to the growing volume of data, th… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 10 pages

    Journal ref: In The 53rd International Conference on Parallel Processing (ICPP 24), August 12-15, 2024, Gotland, Sweden

  11. arXiv:2407.01621  [pdf, other

    cs.LG q-bio.QM stat.ME stat.ML

    Deciphering interventional dynamical causality from non-intervention systems

    Authors: Jifan Shi, Yang Li, Juan Zhao, Siyang Leng, Kazuyuki Aihara, Luonan Chen, Wei Lin

    Abstract: Detecting and quantifying causality is a focal topic in the fields of science, engineering, and interdisciplinary studies. However, causal studies on non-intervention systems attract much attention but remain extremely challenging. To address this challenge, we propose a framework named Interventional Dynamical Causality (IntDC) for such non-intervention systems, along with its computational crite… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  12. arXiv:2406.19659  [pdf

    q-bio.NC

    Object Space is Embodied

    Authors: Shan Xu, Xinran Feng, Yuannan Li, Jia Liu

    Abstract: The perceived similarity between objects has often been attributed to their physical and conceptual features, such as appearance and animacy, and the theoretical framework of object space is accordingly conceived. Here, we extend this framework by proposing that object space may also be defined by embodied features, specifically action possibilities that objects afford to an agent (i.e., affordanc… ▽ More

    Submitted 5 August, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

  13. arXiv:2406.14358  [pdf

    q-bio.NC cs.AI cs.CL

    The neural correlates of logical-mathematical symbol systems processing resemble that of spatial cognition more than natural language processing

    Authors: Yuannan Li, Shan Xu, Jia Liu

    Abstract: The ability to manipulate logical-mathematical symbols (LMS), encompassing tasks such as calculation, reasoning, and programming, is a cognitive skill arguably unique to humans. Considering the relatively recent emergence of this ability in human evolutionary history, it has been suggested that LMS processing may build upon more fundamental cognitive systems, possibly through neuronal recycling. P… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  14. arXiv:2406.14100  [pdf, other

    q-bio.NC

    Self-Attention in Transformer Networks Explains Monkeys' Gaze Pattern in Pac-Man Game

    Authors: Zhongqiao Lin, Yunwei Li, Tianming Yang

    Abstract: We proactively direct our eyes and attention to collect information during problem solving and decision making. Understanding gaze patterns is crucial for gaining insights into the computation underlying the problem-solving process. However, there is a lack of interpretable models that can account for how the brain directs the eyes to collect information and utilize it, especially in the context o… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  15. arXiv:2406.13284  [pdf

    physics.med-ph q-bio.QM

    The association of domain-specific physical activity and sedentary activity with stroke: A prospective cohort study

    Authors: Xinyi He, Shidi Wang, Yi Li, Jiucun Wang, Guangrui Yang, Jun Chen, Zixin Hu

    Abstract: Background The incidence of stroke places a heavy burden on both society and individuals. Activity is closely related to cardiovascular health. This study aimed to investigate the relationship between the varying domains of PA, like occupation-related Physical Activity (OPA), transportation-related Physical Activity (TPA), leisure-time Physical Activity (LTPA), and Sedentary Activity (SA) with str… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  16. arXiv:2406.12002  [pdf, other

    q-bio.PE cs.LG math.NA physics.soc-ph

    Modeling, Inference, and Prediction in Mobility-Based Compartmental Models for Epidemiology

    Authors: Ning Jiang, Weiqi Chu, Yao Li

    Abstract: Classical compartmental models in epidemiology often assume a homogeneous population for simplicity, which neglects the inherent heterogeneity among individuals. This assumption frequently leads to inaccurate predictions when applied to real-world data. For example, evidence has shown that classical models overestimate the final pandemic size in the H1N1-2009 and COVID-19 outbreaks. To address thi… ▽ More

    Submitted 6 September, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 19 pages, 8 figures

  17. arXiv:2406.06393  [pdf, other

    cs.CV cs.CL q-bio.GN

    STimage-1K4M: A histopathology image-gene expression dataset for spatial transcriptomics

    Authors: Jiawen Chen, Muqing Zhou, Wenrong Wu, Jinwei Zhang, Yun Li, Didong Li

    Abstract: Recent advances in multi-modal algorithms have driven and been driven by the increasing availability of large image-text datasets, leading to significant strides in various fields, including computational pathology. However, in most existing medical image-text datasets, the text typically provides high-level summaries that may not sufficiently describe sub-tile regions within a large pathology ima… ▽ More

    Submitted 20 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    ACM Class: I.4.10; I.2.10

  18. arXiv:2406.05170  [pdf

    q-bio.OT cs.CV eess.IV

    Research on Tumors Segmentation based on Image Enhancement Method

    Authors: Danyi Huang, Ziang Liu, Yizhou Li

    Abstract: One of the most effective ways to treat liver cancer is to perform precise liver resection surgery, the key step of which includes precise digital image segmentation of the liver and its tumor. However, traditional liver parenchymal segmentation techniques often face several challenges in performing liver segmentation: lack of precision, slow processing speed, and computational burden. These short… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  19. arXiv:2405.20702  [pdf, other

    q-bio.PE physics.soc-ph

    Effect of antibody levels on the spread of disease in multiple infections

    Authors: Xiangxi Li, Yuhan Li, Minyu Feng, Jürgen Kurths

    Abstract: There are complex interactions between antibody levels and epidemic propagation, the antibody level of an individual influences the probability of infection, and the spread of the virus influences the antibody level of each individual. There exist some viruses that, in their natural state, cause antibody levels in an infected individual to gradually decay. When these antibody levels decay to a cer… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 14 pages, 9 figures

  20. arXiv:2405.12144  [pdf

    q-bio.NC

    Alterations of electrocortical activity during hand movements induced by motor cortex glioma

    Authors: Yihan Wu, Tao Chang, Siliang Chen, Xiaodong Niu, Yu Li, Yuan Fang, Lei Yang, Yixuan Zong, Yaoxin Yang, Yuehua Li, Mengsong Wang, Wen Yang, Yixuan Wu, Chen Fu, Xia Fang, Yuxin Quan, Xilin Peng, Qiang Sun, Marc M. Van Hulle, Yanhui Liu, Ning Jiang, Dario Farina, Yuan Yang, Jiayuan He, Qing Mao

    Abstract: Glioma cells can reshape functional neuronal networks by hijacking neuronal synapses, leading to partial or complete neurological dysfunction. These mechanisms have been previously explored for language functions. However, the impact of glioma on sensorimotor functions is still unknown. Therefore, we recruited a control group of patients with unaffected motor cortex and a group of patients with gl… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  21. arXiv:2405.11769  [pdf, other

    q-bio.BM cs.LG physics.bio-ph

    Uni-Mol Docking V2: Towards Realistic and Accurate Binding Pose Prediction

    Authors: Eric Alcaide, Zhifeng Gao, Guolin Ke, Yaqi Li, Linfeng Zhang, Hang Zheng, Gengmo Zhou

    Abstract: In recent years, machine learning (ML) methods have emerged as promising alternatives for molecular docking, offering the potential for high accuracy without incurring prohibitive computational costs. However, recent studies have indicated that these ML models may overfit to quantitative metrics while neglecting the physical constraints inherent in the problem. In this work, we present Uni-Mol Doc… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  22. arXiv:2405.09851  [pdf, other

    eess.IV cs.CV q-bio.QM

    Region of Interest Detection in Melanocytic Skin Tumor Whole Slide Images -- Nevus & Melanoma

    Authors: Yi Cui, Yao Li, Jayson R. Miedema, Sharon N. Edmiston, Sherif Farag, J. S. Marron, Nancy E. Thomas

    Abstract: Automated region of interest detection in histopathological image analysis is a challenging and important topic with tremendous potential impact on clinical practice. The deep-learning methods used in computational pathology may help us to reduce costs and increase the speed and accuracy of cancer diagnosis. We started with the UNC Melanocytic Tumor Dataset cohort that contains 160 hematoxylin and… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 5 figures, NeurIPS 2022 Workshop

  23. arXiv:2405.05665  [pdf, other

    cs.LG q-bio.QM

    SubGDiff: A Subgraph Diffusion Model to Improve Molecular Representation Learning

    Authors: Jiying Zhang, Zijing Liu, Yu Wang, Yu Li

    Abstract: Molecular representation learning has shown great success in advancing AI-based drug discovery. The core of many recent works is based on the fact that the 3D geometric structure of molecules provides essential information about their physical and chemical characteristics. Recently, denoising diffusion probabilistic models have achieved impressive performance in 3D molecular representation learnin… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 31 pages

  24. arXiv:2405.00753  [pdf, other

    q-bio.QM cs.AI

    HMAMP: Hypervolume-Driven Multi-Objective Antimicrobial Peptides Design

    Authors: Li Wang, Yiping Li, Xiangzheng Fu, Xiucai Ye, Junfeng Shi, Gary G. Yen, Xiangxiang Zeng

    Abstract: Antimicrobial peptides (AMPs) have exhibited unprecedented potential as biomaterials in combating multidrug-resistant bacteria. Despite the increasing adoption of artificial intelligence for novel AMP design, challenges pertaining to conflicting attributes such as activity, hemolysis, and toxicity have significantly impeded the progress of researchers. This paper introduces a paradigm shift by con… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  25. arXiv:2405.00719  [pdf, other

    eess.SP cs.LG q-bio.NC

    EEG-Deformer: A Dense Convolutional Transformer for Brain-computer Interfaces

    Authors: Yi Ding, Yong Li, Hao Sun, Rui Liu, Chengxuan Tong, Cuntai Guan

    Abstract: Effectively learning the temporal dynamics in electroencephalogram (EEG) signals is challenging yet essential for decoding brain activities using brain-computer interfaces (BCIs). Although Transformers are popular for their long-term sequential learning ability in the BCI field, most methods combining Transformers with convolutional neural networks (CNNs) fail to capture the coarse-to-fine tempora… ▽ More

    Submitted 25 April, 2024; originally announced May 2024.

    Comments: 10 pages, 9 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  26. arXiv:2404.15309  [pdf, other

    eess.SP cs.LG q-bio.NC

    Sparse Bayesian Correntropy Learning for Robust Muscle Activity Reconstruction from Noisy Brain Recordings

    Authors: Yuanhao Li, Badong Chen, Natsue Yoshimura, Yasuharu Koike, Okito Yamashita

    Abstract: Sparse Bayesian learning has promoted many effective frameworks for brain activity decoding, especially for the reconstruction of muscle activity. However, existing sparse Bayesian learning mainly employs Gaussian distribution as error assumption in the reconstruction task, which is not necessarily the truth in the real-world application. On the other hand, brain recording is known to be highly no… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  27. arXiv:2404.11761  [pdf, other

    q-bio.MN q-bio.CB

    A computational scheme connecting gene regulatory network dynamics with heterogeneous stem cell regeneration

    Authors: Yakun Li, Xiyin Liang, Jinzhi Lei

    Abstract: Stem cell regeneration is a vital biological process in self-renewing tissues, governing development and tissue homeostasis. Gene regulatory network dynamics are pivotal in controlling stem cell regeneration and cell type transitions. However, integrating the quantitative dynamics of gene regulatory networks at the single-cell level with stem cell regeneration at the population level poses signifi… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 27 pages, 9 figures

  28. arXiv:2404.11199  [pdf, other

    q-bio.BM

    RiboDiffusion: Tertiary Structure-based RNA Inverse Folding with Generative Diffusion Models

    Authors: Han Huang, Ziqian Lin, Dongchen He, Liang Hong, Yu Li

    Abstract: RNA design shows growing applications in synthetic biology and therapeutics, driven by the crucial role of RNA in various biological processes. A fundamental challenge is to find functional RNA sequences that satisfy given structural constraints, known as the inverse folding problem. Computational approaches have emerged to address this problem based on secondary structures. However, designing RNA… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 15 pages

  29. arXiv:2404.10354  [pdf

    q-bio.QM cs.CE cs.LG

    Physical formula enhanced multi-task learning for pharmacokinetics prediction

    Authors: Ruifeng Li, Dongzhan Zhou, Ancheng Shen, Ao Zhang, Mao Su, Mingqian Li, Hongyang Chen, Gang Chen, Yin Zhang, Shufei Zhang, Yuqiang Li, Wanli Ouyang

    Abstract: Artificial intelligence (AI) technology has demonstrated remarkable potential in drug dis-covery, where pharmacokinetics plays a crucial role in determining the dosage, safety, and efficacy of new drugs. A major challenge for AI-driven drug discovery (AIDD) is the scarcity of high-quality data, which often requires extensive wet-lab work. A typical example of this is pharmacokinetic experiments. I… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  30. arXiv:2404.09837  [pdf, other

    math.AP q-bio.PE

    On inverse problems in multi-population aggregation models

    Authors: Yuhan Li, Hongyu Liu, Catharine W. K. Lo

    Abstract: This paper focuses on inverse problems arising in studying multi-population aggregations. The goal is to reconstruct the diffusion coefficient, advection coefficient, and interaction kernels of the aggregation system, which characterize the dynamics of different populations. In the theoretical analysis of the physical setup, it is crucial to ensure non-negativity of solutions. To address this, we… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 29 pages, Keywords: inverse multi-population aggregation model, positive solutions, unique identifiability, transformative asymptotic technique, high-order variation method

    MSC Class: 35R30; 35B09; 35K45; 35Q92; 92-10; 92D25; 92D50; 35B10; 35C20

  31. arXiv:2404.09738  [pdf

    q-bio.BM cs.AI q-bio.QM

    AMPCliff: quantitative definition and benchmarking of activity cliffs in antimicrobial peptides

    Authors: Kewei Li, Yuqian Wu, Yutong Guo, Yinheng Li, Yusi Fan, Ruochi Zhang, Lan Huang, Fengfeng Zhou

    Abstract: Activity cliff (AC) is a phenomenon that a pair of similar molecules differ by a small structural alternation but exhibit a large difference in their biochemical activities. The AC of small molecules has been extensively investigated but limited knowledge is accumulated about the AC phenomenon in peptides with canonical amino acids. This study introduces a quantitative definition and benchmarking… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  32. arXiv:2404.08713  [pdf, other

    eess.IV cs.LG q-bio.QM

    Survival Prediction Across Diverse Cancer Types Using Neural Networks

    Authors: Xu Yan, Weimin Wang, MingXuan Xiao, Yufeng Li, Min Gao

    Abstract: Gastric cancer and Colon adenocarcinoma represent widespread and challenging malignancies with high mortality rates and complex treatment landscapes. In response to the critical need for accurate prognosis in cancer patients, the medical community has embraced the 5-year survival rate as a vital metric for estimating patient outcomes. This study introduces a pioneering approach to enhance survival… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  33. arXiv:2404.04604  [pdf, other

    q-bio.NC

    A diffusion MRI tractography atlas for concurrent white matter mapping across Eastern and Western populations

    Authors: Yijie Li, Wei Zhang, Ye Wu, Li Yin, Ce Zhu, Yuqian Chen, Suheyla Cetin-Karayumak, Kang Ik K Cho, Leo R. Zekelman, Jarrett Rushmore, Yogesh Rathi, Nikos Makris, Lauren J. O'Donnell, Fan Zhang

    Abstract: The study of brain differences across Eastern and Western populations provides vital insights for understanding potential cultural and genetic influences on cognition and mental health. Diffusion MRI (dMRI) tractography is an important tool in assessing white matter (WM) connectivity and brain tissue microstructure across different populations. However, a comprehensive investigation into WM fiber… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  34. arXiv:2403.17299  [pdf, other

    cs.CL q-bio.NC

    Decoding Probing: Revealing Internal Linguistic Structures in Neural Language Models using Minimal Pairs

    Authors: Linyang He, Peili Chen, Ercong Nie, Yuanning Li, Jonathan R. Brennan

    Abstract: Inspired by cognitive neuroscience studies, we introduce a novel `decoding probing' method that uses minimal pairs benchmark (BLiMP) to probe internal linguistic characteristics in neural language models layer by layer. By treating the language model as the `brain' and its representations as `neural activations', we decode grammaticality labels of minimal pairs from the intermediate layers' repres… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted by LREC-COLING 2024

  35. arXiv:2403.11516   

    q-bio.NC

    Perceptual learning in contour detection transfer across changes in contour path and orientation

    Authors: Yue Ding, Hongqiao Shi, Shuang Song, Yonghui Wang, Ya Li

    Abstract: The integration of local elements into shape contours is critical for target detection and identification in cluttered scenes. Previous studies have shown that observers can learn to use image regularities for contour integration and target identification. However, we still know little about the generalization of perceptual learning in contour integration. Specifically, whether training in contour… ▽ More

    Submitted 20 August, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Following the submission of our work, we have discovered that our research is not yet complete and that some important new results have emerged. We believe that incorporating these new findings into our manuscript will significantly strengthen our work and improve its overall impact. Therefore, we have decided to withdraw the current version and revise the manuscript accordingly

  36. arXiv:2403.08192  [pdf, other

    cs.CL q-bio.BM

    MoleculeQA: A Dataset to Evaluate Factual Accuracy in Molecular Comprehension

    Authors: Xingyu Lu, He Cao, Zijing Liu, Shengyuan Bai, Leqing Chen, Yuan Yao, Hai-Tao Zheng, Yu Li

    Abstract: Large language models are playing an increasingly significant role in molecular research, yet existing models often generate erroneous information, posing challenges to accurate molecular comprehension. Traditional evaluation metrics for generated content fail to assess a model's accuracy in molecular understanding. To rectify the absence of factual evaluation, we present MoleculeQA, a novel quest… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 19 pages, 8 figures

  37. arXiv:2402.18784  [pdf, other

    cs.AI q-bio.NC

    Brain-inspired and Self-based Artificial Intelligence

    Authors: Yi Zeng, Feifei Zhao, Yuxuan Zhao, Dongcheng Zhao, Enmeng Lu, Qian Zhang, Yuwei Wang, Hui Feng, Zhuoya Zhao, Jihang Wang, Qingqun Kong, Yinqian Sun, Yang Li, Guobin Shen, Bing Han, Yiting Dong, Wenxuan Pan, Xiang He, Aorigele Bao, Jin Wang

    Abstract: The question "Can machines think?" and the Turing Test to assess whether machines could achieve human-level intelligence is one of the roots of AI. With the philosophical argument "I think, therefore I am", this paper challenge the idea of a "thinking machine" supported by current AIs since there is no sense of self in them. Current artificial intelligence is only seemingly intelligent information… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  38. arXiv:2402.17156  [pdf, other

    cs.LG cs.AI q-bio.BM

    TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation

    Authors: Lin Zongying, Li Hao, Lv Liuzhenghao, Lin Bin, Zhang Junwu, Chen Calvin Yu-Chian, Yuan Li, Tian Yonghong

    Abstract: Designing protein sequences with specific biological functions and structural stability is crucial in biology and chemistry. Generative models already demonstrated their capabilities for reliable protein design. However, previous models are limited to the unconditional generation of protein sequences and lack the controllable generation ability that is vital to biological tasks. In this work, we p… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  39. arXiv:2402.15515  [pdf

    cs.AI q-bio.QM stat.AP

    Feasibility of Identifying Factors Related to Alzheimer's Disease and Related Dementia in Real-World Data

    Authors: Aokun Chen, Qian Li, Yu Huang, Yongqiu Li, Yu-neng Chuang, Xia Hu, Serena Guo, Yonghui Wu, Yi Guo, Jiang Bian

    Abstract: A comprehensive view of factors associated with AD/ADRD will significantly aid in studies to develop new treatments for AD/ADRD and identify high-risk populations and patients for prevention efforts. In our study, we summarized the risk factors for AD/ADRD by reviewing existing meta-analyses and review articles on risk and preventive factors for AD/ADRD. In total, we extracted 477 risk factors in… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  40. arXiv:2402.12391  [pdf, other

    q-bio.GN cs.AI cs.LG

    Toward a Team of AI-made Scientists for Scientific Discovery from Gene Expression Data

    Authors: Haoyang Liu, Yijiang Li, Jinglin Jian, Yuxuan Cheng, Jianrong Lu, Shuyi Guo, Jinglei Zhu, Mianchen Zhang, Miantong Zhang, Haohan Wang

    Abstract: Machine learning has emerged as a powerful tool for scientific discovery, enabling researchers to extract meaningful insights from complex datasets. For instance, it has facilitated the identification of disease-predictive genes from gene expression data, significantly advancing healthcare. However, the traditional process for analyzing such datasets demands substantial human effort and expertise… ▽ More

    Submitted 20 February, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 18 pages, 2 figures; added contact

  41. arXiv:2402.08703  [pdf, other

    q-bio.BM cs.AI cs.LG

    A Survey of Generative AI for de novo Drug Design: New Frontiers in Molecule and Protein Generation

    Authors: Xiangru Tang, Howard Dai, Elizabeth Knight, Fang Wu, Yunyang Li, Tianxiao Li, Mark Gerstein

    Abstract: Artificial intelligence (AI)-driven methods can vastly improve the historically costly drug design process, with various generative models already in widespread use. Generative models for de novo drug design, in particular, focus on the creation of novel biological compounds entirely from scratch, representing a promising future direction. Rapid development in the field, combined with the inherent… ▽ More

    Submitted 26 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

  42. arXiv:2402.04286  [pdf

    q-bio.QM cs.AI cs.LG

    Progress and Opportunities of Foundation Models in Bioinformatics

    Authors: Qing Li, Zhihang Hu, Yixuan Wang, Lei Li, Yimin Fan, Irwin King, Le Song, Yu Li

    Abstract: Bioinformatics has witnessed a paradigm shift with the increasing integration of artificial intelligence (AI), particularly through the adoption of foundation models (FMs). These AI techniques have rapidly advanced, addressing historical challenges in bioinformatics such as the scarcity of annotated data and the presence of data noise. FMs are particularly adept at handling large-scale, unlabeled… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 27 pages, 3 figures, 2 tables

    MSC Class: cs.CL; 92-02 ACM Class: I.2.1

  43. arXiv:2402.01481  [pdf, other

    cs.LG cs.AI q-bio.BM

    Pre-Training Protein Bi-level Representation Through Span Mask Strategy On 3D Protein Chains

    Authors: Jiale Zhao, Wanru Zhuang, Jia Song, Yaqi Li, Shuqi Lu

    Abstract: In recent years, there has been a surge in the development of 3D structure-based pre-trained protein models, representing a significant advancement over pre-trained protein language models in various downstream tasks. However, most existing structure-based pre-trained models primarily focus on the residue level, i.e., alpha carbon atoms, while ignoring other atoms like side chain atoms. We argue t… ▽ More

    Submitted 2 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  44. arXiv:2401.17671  [pdf, other

    cs.CL cs.AI q-bio.NC

    Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain

    Authors: Gavin Mischler, Yinghao Aaron Li, Stephan Bickel, Ashesh D. Mehta, Nima Mesgarani

    Abstract: Recent advancements in artificial intelligence have sparked interest in the parallels between large language models (LLMs) and human neural processing, particularly in language comprehension. While prior research has established similarities in the representation of LLMs and the brain, the underlying computational principles that cause this convergence, especially in the context of evolving LLMs,… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 19 pages, 5 figures and 4 supplementary figures

  45. arXiv:2401.15122  [pdf, other

    cs.LG cs.AI q-bio.BM q-bio.QM stat.ML

    A Multi-Grained Symmetric Differential Equation Model for Learning Protein-Ligand Binding Dynamics

    Authors: Shengchao Liu, Weitao Du, Yanjing Li, Zhuoxinran Li, Vignesh Bhethanabotla, Nakul Rampal, Omar Yaghi, Christian Borgs, Anima Anandkumar, Hongyu Guo, Jennifer Chayes

    Abstract: In drug discovery, molecular dynamics (MD) simulation for protein-ligand binding provides a powerful tool for predicting binding affinities, estimating transport properties, and exploring pocket sites. There has been a long history of improving the efficiency of MD simulations through better numerical methods and, more recently, by utilizing machine learning (ML) methods. Yet, challenges remain, s… ▽ More

    Submitted 1 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

  46. arXiv:2401.14104  [pdf

    q-bio.BM

    Label-free detection of exosomes from different cellular sources based on surface-enhanced Raman spectroscopy combined with machine learning models

    Authors: Yang Li, Xiaoming Lyu, Kuo Zhan, Haoyu Ji, Lei Qin, JianAn Huang

    Abstract: Exosomes are significant facilitators of inter-cellular communication that can unveil cell-cell interactions, signaling pathways, regulatory mechanisms and disease diagnostics. Nonetheless, current analysis required large amount of data for exosome identification that it hampers efficient and timely mechanism study and diagnostics. Here, we used a machine-learning assisted Surface-enhanced Raman s… ▽ More

    Submitted 26 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 5 figures

  47. arXiv:2401.11447  [pdf, other

    cs.LG q-bio.QM

    Sequential Model for Predicting Patient Adherence in Subcutaneous Immunotherapy for Allergic Rhinitis

    Authors: Yin Li, Yu Xiong, Wenxin Fan, Kai Wang, Qingqing Yu, Liping Si, Patrick van der Smagt, Jun Tang, Nutan Chen

    Abstract: Objective: Subcutaneous Immunotherapy (SCIT) is the long-lasting causal treatment of allergic rhinitis (AR). How to enhance the adherence of patients to maximize the benefit of allergen immunotherapy (AIT) plays a crucial role in the management of AIT. This study aims to leverage novel machine learning models to precisely predict the risk of non-adherence of AR patients and related local symptom s… ▽ More

    Submitted 19 July, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: Frontiers in Pharmacology, research topic: Methods and Metrics to Measure Medication Adherence

  48. arXiv:2401.10806  [pdf, ps, other

    q-bio.BM

    DeepRLI: A Multi-objective Framework for Universal Protein--Ligand Interaction Prediction

    Authors: Haoyu Lin, Shiwei Wang, Jintao Zhu, Yibo Li, Jianfeng Pei, Luhua Lai

    Abstract: Protein (receptor)--ligand interaction prediction is a critical component in computer-aided drug design, significantly influencing molecular docking and virtual screening processes. Despite the development of numerous scoring functions in recent years, particularly those employing machine learning, accurately and efficiently predicting binding affinities for protein--ligand complexes remains a for… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  49. arXiv:2401.10144  [pdf, other

    q-bio.BM cs.LG

    Exploiting Hierarchical Interactions for Protein Surface Learning

    Authors: Yiqun Lin, Liang Pan, Yi Li, Ziwei Liu, Xiaomeng Li

    Abstract: Predicting interactions between proteins is one of the most important yet challenging problems in structural bioinformatics. Intrinsically, potential function sites in protein surfaces are determined by both geometric and chemical features. However, existing works only consider handcrafted or individually learned chemical features from the atom type and extract geometric features independently. He… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    Comments: Accepted to J-BHI

  50. On inverse problems in predator-prey models

    Authors: Yuhan Li, Hongyu Liu, Catharine W. K. Lo

    Abstract: In this paper, we consider the inverse problem of determining the coefficients of interaction terms within some Lotka-Volterra models, with support from boundary observation of its non-negative solutions. In the physical background, the solutions to the predator-prey model stand for the population densities for predator and prey and are non-negative, which is a critical challenge in our inverse pr… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    MSC Class: 35R30; 35B09; 35K51; 35Q92; 92-10; 92D25; 35K58

    Journal ref: Journal of Differential Equations Volume 397, 15 July 2024, Pages 349-376