Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 1,308 results for author: Ma, W

.
  1. arXiv:2408.07926  [pdf, other

    eess.SY

    Enhanced Equivalent Circuit Model for High Current Discharge of Lithium-Ion Batteries with Application to Electric Vertical Takeoff and Landing Aircraft

    Authors: Alireza Goshtasbi, Ruxiu Zhao, Ruiting Wang, Sangwoo Han, Wenting Ma, Jeremy Neubauer

    Abstract: Conventional battery equivalent circuit models (ECMs) have limited capability to predict performance at high discharge rates, where lithium depleted regions may develop and cause a sudden exponential drop in the cell's terminal voltage. Having accurate predictions of performance under such conditions is necessary for electric vertical takeoff and landing (eVTOL) aircraft applications, where high d… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  2. arXiv:2408.07641  [pdf, other

    hep-ex

    Exploring New Physics with PandaX-4T Low Energy Electronic Recoil Data

    Authors: PandaX Collaboration, Xinning Zeng, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Zhixing Gao, Lisheng Geng, Karl Giboni, Xunan Guo, Xuyuan Guo, Zichao Guo, Chencheng Han, Ke HanChangda He, Jinrong He, Di Huang, Houqi Huang, Junting Huang, Ruquan Hou, Yu Hou, Xiangdong Ji , et al. (76 additional authors not shown)

    Abstract: New particles beyond the Standard Model of particle physics, such as axions, can be effectively searched through their interactions with electrons. We use the large liquid xenon detector PandaX-4T to search for novel electronic recoil signals induced by solar axions, neutrinos with anomalous magnetic moment, axion-like particles, dark photons, and light fermionic dark matter. A detailed background… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  3. arXiv:2408.04568  [pdf, other

    cs.CL cs.AI

    Learning Fine-Grained Grounded Citations for Attributed Large Language Models

    Authors: Lei Huang, Xiaocheng Feng, Weitao Ma, Yuxuan Gu, Weihong Zhong, Xiachong Feng, Weijiang Yu, Weihua Peng, Duyu Tang, Dandan Tu, Bing Qin

    Abstract: Despite the impressive performance on information-seeking tasks, large language models (LLMs) still struggle with hallucinations. Attributed LLMs, which augment generated text with in-line citations, have shown potential in mitigating hallucinations and improving verifiability. However, current approaches suffer from suboptimal citation quality due to their reliance on in-context learning. Further… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: Accepted by ACL 2024 Findings

  4. arXiv:2408.04392  [pdf, other

    cs.CL

    Open-domain Implicit Format Control for Large Language Model Generation

    Authors: Yiqun Yao, Wenjia Ma, Xuezhi Fang, Xin Jiang, Xiang Li, Xuying Meng, Peng Han, Jing Li, Aixin Sun, Yequan Wang

    Abstract: Controlling the format of outputs generated by large language models (LLMs) is a critical functionality in various applications. Current methods typically employ constrained decoding with rule-based automata or fine-tuning with manually crafted format instructions, both of which struggle with open-domain format requirements. To address this limitation, we introduce a novel framework for controlled… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: 6 pages

  5. arXiv:2408.01697  [pdf, other

    cs.LG cs.AI stat.ML

    Invariant Graph Learning Meets Information Bottleneck for Out-of-Distribution Generalization

    Authors: Wenyu Mao, Jiancan Wu, Haoyang Liu, Yongduo Sui, Xiang Wang

    Abstract: Graph out-of-distribution (OOD) generalization remains a major challenge in graph learning since graph neural networks (GNNs) often suffer from severe performance degradation under distribution shifts. Invariant learning, aiming to extract invariant features across varied distributions, has recently emerged as a promising approach for OOD generation. Despite the great success of invariant learning… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  6. arXiv:2408.01366  [pdf, other

    cs.RO cs.CV

    Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation

    Authors: Ruoxuan Feng, Di Hu, Wenke Ma, Xuelong Li

    Abstract: Humans possess a remarkable talent for flexibly alternating to different senses when interacting with the environment. Picture a chef skillfully gauging the timing of ingredient additions and controlling the heat according to the colors, sounds, and aromas, seamlessly navigating through every stage of the complex cooking process. This ability is founded upon a thorough comprehension of task stages… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  7. arXiv:2408.00754  [pdf, other

    cs.CV cs.LG

    Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

    Authors: Benlin Liu, Yuhao Dong, Yiqin Wang, Yongming Rao, Yansong Tang, Wei-Chiu Ma, Ranjay Krishna

    Abstract: Multimodal language models (MLLMs) are increasingly being implemented in real-world environments, necessitating their ability to interpret 3D spaces and comprehend temporal dynamics. Despite their potential, current top models within our community still fall short in adequately understanding spatial and temporal dimensions. We introduce Coarse Correspondence, a simple, training-free, effective, an… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: project page: https://coarse-correspondence.github.io

  8. arXiv:2408.00664  [pdf, other

    hep-ex

    Dark Matter Search Results from 1.54 Tonne$\cdot$Year Exposure of PandaX-4T

    Authors: PandaX Collaboration, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Zhixing Gao, Lisheng Geng, Karl Giboni, Xunan Guo, Xuyuan Guo, Zichao Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Houqi Huang, Junting Huang, Ruquan Hou, Yu Hou, Xiangdong Ji , et al. (77 additional authors not shown)

    Abstract: In this letter, we report the dark matter search results from the commissioning run and the first science run of the PandaX-4T experiment. A blind analysis is carried out on the entire data set. The data processing is improved compared to previous work, unifying the low-level signal reconstruction in a wide energy range up to 120 keV. With a total exposure of 1.54 tonne$\cdot$year, no significant… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  9. arXiv:2408.00058  [pdf, ps, other

    cond-mat.mes-hall cond-mat.str-el gr-qc

    Information Scrambling at Quantum Hall Interfaces and Their Analog to Black Hole Event Horizon

    Authors: Ken K. W. Ma, Kun Yang

    Abstract: The black hole information paradox has been hotly debated for the last few decades without a full resolution. This makes it desirable to find analogues of this paradox in simple and experimentally accessible systems, whose resolutions may shed light on this longstanding and fundamental problem. Here, we review and resolve the apparent "information paradox" in two different interfaces separating Ab… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: 44 pages, 5 figures. An invited book chapter for "The Black Hole Information Paradox", to be published by Springer Singapore in 2025. arXiv admin note: substantial text overlap with arXiv:2209.11119, arXiv:2106.11306

  10. arXiv:2407.21709  [pdf

    physics.plasm-ph

    Influences of $δ$B Contribution and Parallel Inertial Term of Energetic Particles on MHD-Kinetic Hybrid Simulations: A Case Study of the 1/1 Internal Kink Mode

    Authors: H. X. Zhang, H. W. Zhang, Z. W. Ma, C. Liu

    Abstract: The magnetohydrodynamic-kinetic (MHD-kinetic) hybrid model [Park et. al., 1992] has been widely applied in studying energetic particles (EPs) problems in fusion plasmas for past decades. The pressure-coupling scheme or the current-coupling scheme is adopted in this model. However, two noteworthy issues arise in the model application: firstly, the coupled term introduced in the pressure-coupling sc… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: 31 pages, 16 figures, 2 tables

  11. arXiv:2407.21683  [pdf, other

    physics.geo-ph

    Generative Diffusion Model for Seismic Imaging Improvement of Sparsely Acquired Data and Uncertainty Quantification

    Authors: Xingchen Shi, Shijun Cheng, Weijian Mao, Wei Ouyang

    Abstract: Seismic imaging from sparsely acquired data faces challenges such as low image quality, discontinuities, and migration swing artifacts. Existing convolutional neural network (CNN)-based methods struggle with complex feature distributions and cannot effectively assess uncertainty, making it hard to evaluate the reliability of their processed results. To address these issues, we propose a new method… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  12. arXiv:2407.20679  [pdf, other

    cs.CE

    Online Prediction-Assisted Safe Reinforcement Learning for Electric Vehicle Charging Station Recommendation in Dynamically Coupled Transportation-Power Systems

    Authors: Qionghua Liao, Guilong Li, Jiajie Yu, Ziyuan Gu, Wei Ma

    Abstract: With the proliferation of electric vehicles (EVs), the transportation network and power grid become increasingly interdependent and coupled via charging stations. The concomitant growth in charging demand has posed challenges for both networks, highlighting the importance of charging coordination. Existing literature largely overlooks the interactions between power grid security and traffic effici… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 33 pages, 31 figures

  13. arXiv:2407.20419  [pdf, other

    cs.DS

    Randomized Rounding Approaches to Online Allocation, Sequencing, and Matching

    Authors: Will Ma

    Abstract: Randomized rounding is a technique that was originally used to approximate hard offline discrete optimization problems from a mathematical programming relaxation. Since then it has also been used to approximately solve sequential stochastic optimization problems, overcoming the curse of dimensionality. To elaborate, one first writes a tractable linear programming relaxation that prescribes probabi… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: To appear as an INFORMS 2024 Tutorial

  14. Channel-Aware Low-Rank Adaptation in Time Series Forecasting

    Authors: Tong Nie, Yuewen Mei, Guoyang Qin, Jian Sun, Wei Ma

    Abstract: The balance between model capacity and generalization has been a key focus of recent discussions in long-term time series forecasting. Two representative channel strategies are closely associated with model expressivity and robustness, including channel independence (CI) and channel dependence (CD). The former adopts individual channel treatment and has been shown to be more robust to distribution… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: Accepted by CIKM 2024, short research paper track

  15. arXiv:2407.17115  [pdf, other

    cs.IR

    Reinforced Prompt Personalization for Recommendation with Large Language Models

    Authors: Wenyu Mao, Jiancan Wu, Weijian Chen, Chongming Gao, Xiang Wang, Xiangnan He

    Abstract: Designing effective prompts can empower LLMs to understand user preferences and provide recommendations by leveraging LLMs' intent comprehension and knowledge utilization capabilities. However, existing research predominantly concentrates on task-wise prompting, developing fixed prompt templates composed of four patterns (i.e., role-playing, history records, reasoning guidance, and output format)… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  16. arXiv:2407.15645  [pdf, other

    cs.CL cs.AI

    Psychometric Alignment: Capturing Human Knowledge Distributions via Language Models

    Authors: Joy He-Yueya, Wanjing Anya Ma, Kanishk Gandhi, Benjamin W. Domingue, Emma Brunskill, Noah D. Goodman

    Abstract: Language models (LMs) are increasingly used to simulate human-like responses in scenarios where accurately mimicking a population's behavior can guide decision-making, such as in developing educational materials and designing public policies. The objective of these simulations is for LMs to capture the variations in human responses, rather than merely providing the expected correct answers. Prior… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Code and data: https://github.com/joyheyueya/psychometric-alignment

  17. arXiv:2407.14899  [pdf, other

    cs.LG cs.CV

    Hyperspectral Unmixing Under Endmember Variability: A Variational Inference Framework

    Authors: Yuening Li, Xiao Fu, Junbin Liu, Wing-Kin Ma

    Abstract: This work proposes a variational inference (VI) framework for hyperspectral unmixing in the presence of endmember variability (HU-EV). An EV-accounted noisy linear mixture model (LMM) is considered, and the presence of outliers is also incorporated into the model. Following the marginalized maximum likelihood (MML) principle, a VI algorithmic structure is designed for probabilistic inference for H… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  18. arXiv:2407.13094  [pdf, other

    cs.CV

    Rethinking Video-Text Understanding: Retrieval from Counterfactually Augmented Data

    Authors: Wufei Ma, Kai Li, Zhongshi Jiang, Moustafa Meshry, Qihao Liu, Huiyu Wang, Christian Häne, Alan Yuille

    Abstract: Recent video-text foundation models have demonstrated strong performance on a wide variety of downstream video understanding tasks. Can these video-text models genuinely understand the contents of natural videos? Standard video-text evaluations could be misleading as many questions can be inferred merely from the objects and contexts in a single frame or biases inherent in the datasets. In this pa… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: ECCV 2024. Project page: https://feint6k.github.io

  19. arXiv:2407.12791  [pdf, other

    cs.CL cs.AI

    TourLLM: Enhancing LLMs with Tourism Knowledge

    Authors: Qikai Wei, Mingzhi Yang, Jinqiang Wang, Wenwei Mao, Jiabo Xu, Huansheng Ning

    Abstract: Recently, large language models (LLMs) have demonstrated their effectiveness in various natural language processing (NLP) tasks. However, the lack of tourism knowledge limits the performance of LLMs in tourist attraction presentations and travel planning. To address this challenge, we constructed a supervised fine-tuning dataset for the culture and tourism domain, named Cultour. This dataset consi… ▽ More

    Submitted 18 June, 2024; originally announced July 2024.

  20. arXiv:2407.12060  [pdf

    q-bio.QM q-bio.CB q-bio.PE

    Decoding Dmrt1: Insights into vertebrate sex determination and gonadal sex differentiation

    Authors: Barbora Augstenová, Wen-Juan Ma

    Abstract: Dmrt1 is pivotal in testis formation and function by interacting with genes crucial for Sertoli cell differentiation, such as Sox9. It represses female-determining pathways and ovarian formation by silencing Foxl2. Across 127 vertebrate species, Dmrt1 exhibits sexually dimorphic expression, prior to and during gonadal sex differentiation and in adult testes, implicating its role in master regulati… ▽ More

    Submitted 8 August, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: 40 pages, 4 figures, 2 tables, 1 supplementary table

  21. arXiv:2407.11266  [pdf, other

    cs.CV

    Towards High-Quality 3D Motion Transfer with Realistic Apparel Animation

    Authors: Rong Wang, Wei Mao, Changsheng Lu, Hongdong Li

    Abstract: Animating stylized characters to match a reference motion sequence is a highly demanded task in film and gaming industries. Existing methods mostly focus on rigid deformations of characters' body, neglecting local deformations on the apparel driven by physical dynamics. They deform apparel the same way as the body, leading to results with limited details and unrealistic artifacts, e.g. body-appare… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  22. arXiv:2407.11086  [pdf, other

    cs.LG cs.AI physics.chem-ph

    Pre-training with Fractional Denoising to Enhance Molecular Property Prediction

    Authors: Yuyan Ni, Shikun Feng, Xin Hong, Yuancheng Sun, Wei-Ying Ma, Zhi-Ming Ma, Qiwei Ye, Yanyan Lan

    Abstract: Deep learning methods have been considered promising for accelerating molecular screening in drug discovery and material design. Due to the limited availability of labelled data, various self-supervised molecular pre-training methods have been presented. While many existing methods utilize common pre-training tasks in computer vision (CV) and natural language processing (NLP), they often overlook… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  23. arXiv:2407.11079  [pdf, ps, other

    eess.SP cs.IT

    One-Bit MIMO Detection: From Global Maximum-Likelihood Detector to Amplitude Retrieval Approach

    Authors: Mingjie Shao, Wei-Kun Chen, Cheng-Yang Yu, Ya-Feng Liu, Wing-Kin Ma

    Abstract: As communication systems advance towards the future 6G era, the incorporation of large-scale antenna arrays in base stations (BSs) presents challenges such as increased hardware costs and energy consumption. To address these issues, the use of one-bit analog-to-digital converters (ADCs)/digital-to-analog converters (DACs) has gained significant attentions. This paper focuses on one-bit multiple-in… ▽ More

    Submitted 16 July, 2024; v1 submitted 13 July, 2024; originally announced July 2024.

  24. arXiv:2407.10892  [pdf, other

    hep-ex astro-ph.SR nucl-ex

    First Measurement of Solar $^8$B Neutrino Flux through Coherent Elastic Neutrino-Nucleus Scattering in PandaX-4T

    Authors: PandaX Collaboration, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Zhixing Gao, Lisheng Geng, Karl Giboni, Xunan Guo, Xuyuan Guo, Zichao Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Houqi Huang, Junting Huang, Ruquan Hou, Yu Hou, Xiangdong Ji , et al. (77 additional authors not shown)

    Abstract: The PandaX-4T liquid xenon detector at the China Jinping Underground Laboratory is used to measure the solar $^8$B neutrino flux by detecting neutrinos through coherent scattering with xenon nuclei. Data samples requiring the coincidence of scintillation and ionization signals (paired), as well as unpaired ionization-only signals (US2), are selected with energy threshold of approximately 1.1 keV (… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  25. arXiv:2407.05001  [pdf, other

    stat.ME

    Treatment effect estimation under covariate-adaptive randomization with heavy-tailed outcomes

    Authors: Hongzi Li, Wei Ma, Yingying Ma, Hanzhong Liu

    Abstract: Randomized experiments are the gold standard for investigating causal relationships, with comparisons of potential outcomes under different treatment groups used to estimate treatment effects. However, outcomes with heavy-tailed distributions pose significant challenges to traditional statistical approaches. While recent studies have explored these issues under simple randomization, their applicat… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  26. arXiv:2407.02788  [pdf, other

    quant-ph physics.acc-ph physics.optics

    Generalized Gouy Rotation of Electron Vortex beams in uniform magnetic fields

    Authors: Qi Meng, Xuan Liu, Wei Ma, Zhen Yang, Liang Lu, Alexander J. Silenko, Pengming Zhang, Liping Zou

    Abstract: The rotation of electron vortex beams (EVBs) presents a complex interplay of the Gouy phase characterizing free-space behavior and Landau states or Larmor rotation observed in magnetic fields. Despite being studied separately, these phenomena manifest within a single beam during its propagation in magnetic fields, lacking a comprehensive description. We address this by utilizing exact solutions of… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  27. arXiv:2407.01220  [pdf, other

    cs.CV

    Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation

    Authors: Zihan Gao, Lingling Li, Licheng Jiao, Fang Liu, Xu Liu, Wenping Ma, Yuwei Guo, Shuyuan Yang

    Abstract: Understanding 3D scenes is a crucial challenge in computer vision research with applications spanning multiple domains. Recent advancements in distilling 2D vision-language foundation models into neural fields, like NeRF and 3DGS, enables open-vocabulary segmentation of 3D scenes from 2D multi-view images without the need for precise 3D annotations. While effective, however, the per-pixel distilla… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 16 pages, 7 figures

  28. arXiv:2407.00569  [pdf, other

    cs.CV cs.AI cs.CL

    Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models

    Authors: Weihong Zhong, Xiaocheng Feng, Liang Zhao, Qiming Li, Lei Huang, Yuxuan Gu, Weitao Ma, Yuan Xu, Bing Qin

    Abstract: Though advanced in understanding visual information with human languages, Large Vision-Language Models (LVLMs) still suffer from multimodal hallucinations. A natural concern is that during multimodal interaction, the generated hallucinations could influence the LVLMs' subsequent generation. Thus, we raise a question: When presented with a query relevant to the previously generated hallucination, w… ▽ More

    Submitted 3 August, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

    Comments: Accepted to ACL 2024 Main Conference. 21 pages, 20 figures

  29. arXiv:2406.18864  [pdf, other

    cs.CV

    Learning Modality Knowledge Alignment for Cross-Modality Transfer

    Authors: Wenxuan Ma, Shuang Li, Lincan Cai, Jingxuan Kang

    Abstract: Cross-modality transfer aims to leverage large pretrained models to complete tasks that may not belong to the modality of pretraining data. Existing works achieve certain success in extending classical finetuning to cross-modal scenarios, yet we still lack understanding about the influence of modality gap on the transfer. In this work, a series of experiments focusing on the source representation… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: ICML 2024

  30. arXiv:2406.18599  [pdf, other

    physics.ins-det nucl-ex nucl-th

    Fudan Multi-purpose Active TArget Time Projection Chamber (fMeta-TPC) for Photonnuclear Reaction Experiments

    Authors: Huang-Kai Wu, Xi-Yang Wang, Yu-Miao Wang, You-Jing Wang, De-Qing Fang, Wan-Bing He, Wei-Hu Ma, Xi-Guang Cao, Chang-Bo Fu, Xian-Gai Deng, Yu-Gang Ma

    Abstract: Active Target Time Projection Chambers (AT-TPCs) are state-of-the-art tools in the field of low-energy nuclear physics, particularly suitable for experiments using low-intensity radioactive ion beams or gamma rays. The Fudan Multi-purpose Active Target Time Projection Chamber (fMeta-TPC) with 2048 channels has been developed to study $α$-clustering nuclei. {\fcb In this work, the focus is on the s… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures

  31. arXiv:2406.18115  [pdf, other

    cs.RO cs.AI cs.CV

    Open-vocabulary Mobile Manipulation in Unseen Dynamic Environments with 3D Semantic Maps

    Authors: Dicong Qiu, Wenzong Ma, Zhenfu Pan, Hui Xiong, Junwei Liang

    Abstract: Open-Vocabulary Mobile Manipulation (OVMM) is a crucial capability for autonomous robots, especially when faced with the challenges posed by unknown and dynamic environments. This task requires robots to explore and build a semantic understanding of their surroundings, generate feasible plans to achieve manipulation goals, adapt to environmental changes, and comprehend natural language instruction… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Open-vocabulary, Mobile Manipulation, Dynamic Environments, 3D Semantic Maps, Zero-shot, LLMs, VLMs, 18 pages, 2 figures

  32. arXiv:2406.17797  [pdf, other

    physics.chem-ph cs.AI cs.LG

    MoleculeCLA: Rethinking Molecular Benchmark via Computational Ligand-Target Binding Analysis

    Authors: Shikun Feng, Jiaxin Zheng, Yinjun Jia, Yanwen Huang, Fengfeng Zhou, Wei-Ying Ma, Yanyan Lan

    Abstract: Molecular representation learning is pivotal for various molecular property prediction tasks related to drug discovery. Robust and accurate benchmarks are essential for refining and validating current methods. Existing molecular property benchmarks derived from wet experiments, however, face limitations such as data volume constraints, unbalanced label distribution, and noisy labels. To address th… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  33. arXiv:2406.15796  [pdf, other

    cs.CL

    Rethinking Entity-level Unlearning for Large Language Models

    Authors: Weitao Ma, Xiaocheng Feng, Weihong Zhong, Lei Huang, Yangfan Ye, Bing Qin

    Abstract: Large language model unlearning has gained increasing attention due to its potential to mitigate security and privacy concerns. Current research predominantly focuses on Instance-level unlearning, specifically aiming at forgetting predefined instances of sensitive content. However, a notable gap still exists in exploring the deletion of complete entity-related information, which is crucial in many… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Work in progress

  34. arXiv:2406.14884  [pdf, other

    cs.CL

    FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents

    Authors: Ruixuan Xiao, Wentao Ma, Ke Wang, Yuchuan Wu, Junbo Zhao, Haobo Wang, Fei Huang, Yongbin Li

    Abstract: LLM-based agents have emerged as promising tools, which are crafted to fulfill complex tasks by iterative planning and action. However, these agents are susceptible to undesired planning hallucinations when lacking specific knowledge for expertise-intensive tasks. To address this, preliminary attempts are made to enhance planning reliability by incorporating external workflow-related knowledge. De… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  35. arXiv:2406.14506  [pdf, ps, other

    cs.DS cs.DM math.CO

    Online Matching and Contention Resolution for Edge Arrivals with Vanishing Probabilities

    Authors: Will Ma, Calum MacRury, Pranav Nuti

    Abstract: We study the performance of sequential contention resolution and matching algorithms on random graphs with vanishing edge probabilities. When the edges of the graph are processed in an adversarially-chosen order, we derive a new OCRS that is $0.382$-selectable, attaining the "independence benchmark" from the literature under the vanishing edge probabilities assumption. Complementary to this positi… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    ACM Class: F.2.2; G.2.2

    Journal ref: In EC 2024

  36. Unifying Graph Convolution and Contrastive Learning in Collaborative Filtering

    Authors: Yihong Wu, Le Zhang, Fengran Mo, Tianyu Zhu, Weizhi Ma, Jian-Yun Nie

    Abstract: Graph-based models and contrastive learning have emerged as prominent methods in Collaborative Filtering (CF). While many existing models in CF incorporate these methods in their design, there seems to be a limited depth of analysis regarding the foundational principles behind them. This paper bridges graph convolution, a pivotal element of graph-based models, with contrastive learning through a t… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: KDD 2024

  37. arXiv:2406.13323  [pdf, other

    physics.optics

    An alkali-referenced vector spectrum analyzer for visible-light integrated photonics

    Authors: Baoqi Shi, Ming-Yang Zheng, Yunkai Zhao, Yi-Han Luo, Jinbao Long, Wei Sun, Wenbo Ma, Xiu-Ping Xie, Lan Gao, Chen Shen, Anting Wang, Wei Liang, Qiang Zhang, Junqiu Liu

    Abstract: Integrated photonics has reformed our information society by offering on-chip optical signal synthesis, processing and detection with reduced size, weight and power consumption. As such, it has been successfully established in the near-infrared (NIR) telecommunication bands. With the soaring demand in miniaturized systems for biosensing, quantum information and transportable atomic clocks, extensi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  38. arXiv:2406.12646  [pdf, other

    eess.IV cs.AI cs.CV

    An Empirical Study on the Fairness of Foundation Models for Multi-Organ Image Segmentation

    Authors: Qin Li, Yizhe Zhang, Yan Li, Jun Lyu, Meng Liu, Longyu Sun, Mengting Sun, Qirong Li, Wenyue Mao, Xinran Wu, Yajing Zhang, Yinghua Chu, Shuo Wang, Chengyan Wang

    Abstract: The segmentation foundation model, e.g., Segment Anything Model (SAM), has attracted increasing interest in the medical image community. Early pioneering studies primarily concentrated on assessing and improving SAM's performance from the perspectives of overall accuracy and efficiency, yet little attention was given to the fairness considerations. This oversight raises questions about the potenti… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted to MICCAI-2024

  39. arXiv:2406.11775  [pdf, other

    cs.CV cs.AI

    Task Me Anything

    Authors: Jieyu Zhang, Weikai Huang, Zixian Ma, Oscar Michel, Dong He, Tanmay Gupta, Wei-Chiu Ma, Ali Farhadi, Aniruddha Kembhavi, Ranjay Krishna

    Abstract: Benchmarks for large multimodal language models (MLMs) now serve to simultaneously assess the general capabilities of models instead of evaluating for a specific capability. As a result, when a developer wants to identify which models to use for their application, they are overwhelmed by the number of benchmarks and remain uncertain about which benchmark's results are most reflective of their spec… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: website: https://www.task-me-anything.org

  40. arXiv:2406.10744  [pdf, other

    cs.CV

    Technique Report of CVPR 2024 PBDL Challenges

    Authors: Ying Fu, Yu Li, Shaodi You, Boxin Shi, Linwei Chen, Yunhao Zou, Zichun Wang, Yichen Li, Yuze Han, Yingkai Zhang, Jianan Wang, Qinglin Liu, Wei Yu, Xiaoqian Lv, Jianing Li, Shengping Zhang, Xiangyang Ji, Yuanpei Chen, Yuhan Zhang, Weihang Peng, Liwen Zhang, Zhe Xu, Dingyong Gou, Cong Li, Senyan Xu , et al. (75 additional authors not shown)

    Abstract: The intersection of physics-based vision and deep learning presents an exciting frontier for advancing computer vision technologies. By leveraging the principles of physics to inform and enhance deep learning models, we can develop more robust and accurate vision systems. Physics-based vision aims to invert the processes to recover scene properties such as shape, reflectance, light distribution, a… ▽ More

    Submitted 12 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 PBDL Challenges: https://pbdl-ws.github.io/pbdl2024/challenge/index.html

  41. arXiv:2406.10303  [pdf, other

    cs.CL cs.AI

    A Survey on Large Language Models from General Purpose to Medical Applications: Datasets, Methodologies, and Evaluations

    Authors: Jinqiang Wang, Huansheng Ning, Yi Peng, Qikai Wei, Daniel Tesfai, Wenwei Mao, Tao Zhu, Runhe Huang

    Abstract: Large Language Models (LLMs) have demonstrated surprising performance across various natural language processing tasks. Recently, medical LLMs enhanced with domain-specific knowledge have exhibited excellent capabilities in medical consultation and diagnosis. These models can smoothly simulate doctor-patient dialogues and provide professional medical advice. Most medical LLMs are developed through… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 20 pages,3 figures

  42. arXiv:2406.10215  [pdf, other

    cs.CL cs.LG

    DevBench: A multimodal developmental benchmark for language learning

    Authors: Alvin Wei Ming Tan, Sunny Yu, Bria Long, Wanjing Anya Ma, Tonya Murray, Rebecca D. Silverman, Jason D. Yeatman, Michael C. Frank

    Abstract: How (dis)similar are the learning trajectories of vision-language models and children? Recent modeling work has attempted to understand the gap between models' and humans' data efficiency by constructing models trained on less data, especially multimodal naturalistic data. However, such models are often evaluated on adult-level benchmarks, with limited breadth in language abilities tested, and wit… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  43. arXiv:2406.09613  [pdf, other

    cs.CV

    ImageNet3D: Towards General-Purpose Object-Level 3D Understanding

    Authors: Wufei Ma, Guanning Zeng, Guofeng Zhang, Qihao Liu, Letian Zhang, Adam Kortylewski, Yaoyao Liu, Alan Yuille

    Abstract: A vision model with general-purpose object-level 3D understanding should be capable of inferring both 2D (e.g., class name and bounding box) and 3D information (e.g., 3D location and 3D viewpoint) for arbitrary rigid objects in natural images. This is a challenging task, as it involves inferring 3D information from 2D signals and most importantly, generalizing to rigid objects from unseen categori… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  44. arXiv:2406.09003  [pdf, other

    cs.CV cs.LG

    Enhancing Cross-Modal Fine-Tuning with Gradually Intermediate Modality Generation

    Authors: Lincan Cai, Shuang Li, Wenxuan Ma, Jingxuan Kang, Binhui Xie, Zixun Sun, Chengwei Zhu

    Abstract: Large-scale pretrained models have proven immensely valuable in handling data-intensive modalities like text and image. However, fine-tuning these models for certain specialized modalities, such as protein sequence and cosmic ray, poses challenges due to the significant modality discrepancy and scarcity of labeled data. In this paper, we propose an end-to-end method, PaRe, to enhance cross-modal f… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  45. arXiv:2406.08980  [pdf, other

    q-bio.BM cs.LG

    From Theory to Therapy: Reframing SBDD Model Evaluation via Practical Metrics

    Authors: Bowen Gao, Haichuan Tan, Yanwen Huang, Minsi Ren, Xiao Huang, Wei-Ying Ma, Ya-Qin Zhang, Yanyan Lan

    Abstract: Recent advancements in structure-based drug design (SBDD) have significantly enhanced the efficiency and precision of drug discovery by generating molecules tailored to bind specific protein pockets. Despite these technological strides, their practical application in real-world drug development remains challenging due to the complexities of synthesizing and testing these molecules. The reliability… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  46. arXiv:2406.08961  [pdf, other

    q-bio.BM cs.LG

    SIU: A Million-Scale Structural Small Molecule-Protein Interaction Dataset for Unbiased Bioactivity Prediction

    Authors: Yanwen Huang, Bowen Gao, Yinjun Jia, Hongbo Ma, Wei-Ying Ma, Ya-Qin Zhang, Yanyan Lan

    Abstract: Small molecules play a pivotal role in modern medicine, and scrutinizing their interactions with protein targets is essential for the discovery and development of novel, life-saving therapeutics. The term "bioactivity" encompasses various biological effects resulting from these interactions, including both binding and functional responses. The magnitude of bioactivity dictates the therapeutic or t… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  47. arXiv:2406.08953  [pdf, other

    cs.CV cs.LG

    Preserving Identity with Variational Score for General-purpose 3D Editing

    Authors: Duong H. Le, Tuan Pham, Aniruddha Kembhavi, Stephan Mandt, Wei-Chiu Ma, Jiasen Lu

    Abstract: We present Piva (Preserving Identity with Variational Score Distillation), a novel optimization-based method for editing images and 3D models based on diffusion models. Specifically, our approach is inspired by the recently proposed method for 2D image editing - Delta Denoising Score (DDS). We pinpoint the limitations in DDS for 2D and 3D editing, which causes detail loss and over-saturation. To a… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 22 pages, 14 figures

  48. arXiv:2406.08743  [pdf, other

    cs.LG

    Generalizable Implicit Neural Representation As a Universal Spatiotemporal Traffic Data Learner

    Authors: Tong Nie, Guoyang Qin, Wei Ma, Jian Sun

    Abstract: $\textbf{This is the conference version of our paper: Spatiotemporal Implicit Neural Representation as a Generalized Traffic Data Learner}… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted by the Conference in Emerging Technologies in Transportation Systems (TRC-30). arXiv admin note: substantial text overlap with arXiv:2405.03185

  49. arXiv:2406.08189  [pdf, other

    nlin.SI

    Inverse scattering transform for the defocusing-defocusing coupled Hirota equations with non-zero boundary conditions: double-pole solutions

    Authors: Peng-Fei Han, Wen-Xiu Ma, Ru-Suo Ye, Yi Zhang

    Abstract: The inverse scattering transform for the defocusing-defocusing coupled Hirota equations with non-zero boundary conditions at infinity is thoroughly discussed. We delve into the analytical properties of the Jost eigenfunctions and scrutinize the characteristics of the scattering coefficients. To enhance our investigation of the fundamental eigenfunctions, we have derived additional auxiliary eigenf… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  50. arXiv:2406.07791  [pdf, other

    cs.CL cs.AI

    Judging the Judges: A Systematic Investigation of Position Bias in Pairwise Comparative Assessments by LLMs

    Authors: Lin Shi, Chiyu Ma, Weicheng Ma, Soroush Vosoughi

    Abstract: LLM-as-a-Judge offers a promising alternative to human judges across various tasks, yet inherent biases, particularly position bias - a systematic preference for answers based on their position in the prompt - compromise its effectiveness. Our study investigates this issue by developing a framework to systematically study and quantify position bias using metrics such as repetitional consistency, p… ▽ More

    Submitted 12 August, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: 70 pages, around 200 figures and subfigures