Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 505 results for author: Deng, S

.
  1. arXiv:2408.04236  [pdf, other

    cs.LG cs.AI

    Cluster-Wide Task Slowdown Detection in Cloud System

    Authors: Feiyi Chen, Yingying Zhang, Lunting Fan, Yuxuan Liang, Guansong Pang, Qingsong Wen, Shuiguang Deng

    Abstract: Slow task detection is a critical problem in cloud operation and maintenance since it is highly related to user experience and can bring substantial liquidated damages. Most anomaly detection methods detect it from a single-task aspect. However, considering millions of concurrent tasks in large-scale cloud computing clusters, it becomes impractical and inefficient. Moreover, single-task slowdowns… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: This paper has been accepted by KDD2024

  2. arXiv:2408.02017  [pdf, ps, other

    math.DS

    Existence of generalized solitary waves for a diatomic Fermi-Pasta-Ulam-Tsingou lattice

    Authors: Shengfu Deng, Shu-Ming Sun

    Abstract: This paper concerns the existence of generalized solitary waves (solitary waves with small ripples at infinity) for a diatomic Fermi-Pasta-Ulam-Tsingou (FPUT) lattice. It is proved that the FPUT lattice problem has a generalized solitary-wave solution with the amplitude of those ripples algebraically small using dynamical system approach. The problem is first formulated as a dynamical system probl… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    MSC Class: 37L60; 74J35; 34C37; 34D10

  3. arXiv:2407.19851  [pdf, other

    q-bio.PE cond-mat.stat-mech nlin.AO

    Evolution of cooperation in the public goods game with Q-learning

    Authors: Guozhong Zheng, Jiqiang Zhang, Shengfeng Deng, Weiran Cai, Li Chen

    Abstract: Recent paradigm shifts from imitation learning to reinforcement learning (RL) is shown to be productive in understanding human behaviors. In the RL paradigm, individuals search for optimal strategies through interaction with the environment to make decisions. This implies that gathering, processing, and utilizing information from their surroundings are crucial. However, existing studies typically… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: 16 pages, 12 figures, comments are appreciated

  4. arXiv:2407.19634  [pdf, other

    q-bio.PE cond-mat.stat-mech nlin.AO

    The evolution of cooperation with Q-learning: the impact of information perception

    Authors: Guozhong Zheng, Zhenwei Ding, Jiqiang Zhang, Shengfeng Deng, Weiran Cai, Li Chen

    Abstract: The inherent huge complexities in human beings show a remarkable diversity in response to complex surroundings, enabling us to tackle problems from different perspectives. In the realm of cooperation studies, however, existing work assumes that individuals get access to the same kind of information to make their decisions, in contrast to the facts that individuals often perceive differently. Here,… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 12pages, 13figures, comments are appreciated

  5. arXiv:2407.15133  [pdf

    physics.app-ph physics.optics

    Harmonizing Material Quantity and Terahertz Wave Interference Shielding Efficiency with Metallic Borophene Nanosheets

    Authors: Haojian Lin, Ximiao Wang, Zhaolong Cao, Hongjia Zhu, Jiahao Wu, Runze Zhan, Ningsheng Xu, Shaozhi Deng, Huanjun Chen, Fei Liu

    Abstract: Materials with electromagnetic interference (EMI) shielding in the terahertz (THz) regime, while minimizing the quantity used, are highly demanded for future information communication, healthcare and mineral resource exploration applications. Currently, there is often a trade-off between the amount of material used and the absolute EMI shielding effectiveness (EESt) for the EMI shielding materials… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  6. arXiv:2407.15081  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.mes-hall physics.comp-ph

    Deterministic and Efficient Switching of Sliding Ferroelectrics

    Authors: Shihan Deng, Hongyu Yu, Junyi Ji, Changsong Xu, Hongjun Xiang

    Abstract: Recent studies highlight the scientific importance and broad application prospects of two-dimensional (2D) sliding ferroelectrics, which prevalently exhibit vertical polarization with suitable stackings. It is crucial to understand the mechanisms of sliding ferroelectricity and to deterministically and efficiently switch the polarization with optimized electric fields. Here, applying our newly dev… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: Main text: 16 pages, 4 figures. Supplementary: 9 pages, 6 figures

  7. arXiv:2407.15017  [pdf, other

    cs.CL cs.AI cs.CV cs.HC cs.LG

    Knowledge Mechanisms in Large Language Models: A Survey and Perspective

    Authors: Mengru Wang, Yunzhi Yao, Ziwen Xu, Shuofei Qiao, Shumin Deng, Peng Wang, Xiang Chen, Jia-Chen Gu, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen, Ningyu Zhang

    Abstract: Understanding knowledge mechanisms in Large Language Models (LLMs) is crucial for advancing towards trustworthy AGI. This paper reviews knowledge mechanism analysis from a novel taxonomy including knowledge utilization and evolution. Knowledge utilization delves into the mechanism of memorization, comprehension and application, and creation. Knowledge evolution focuses on the dynamic progression o… ▽ More

    Submitted 31 July, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: Ongoing work (v2); add Section 5: Application of Knowledge Mechanism; revise Section 6 and 7; fix typos

  8. arXiv:2407.08584  [pdf, other

    cs.DC

    Data-Locality-Aware Task Assignment and Scheduling for Distributed Job Executions

    Authors: Hailiang Zhao, Xueyan Tang, Peng Chen, Jianwei Yin, Shuiguang Deng

    Abstract: This paper investigates a data-locality-aware task assignment and scheduling problem aimed at minimizing job completion times for distributed job executions. Without prior knowledge of future job arrivals, we propose an optimal balanced task assignment algorithm (OBTA) that minimizes the completion time of each arriving job. We significantly reduce OBTA's computational overhead by narrowing the se… ▽ More

    Submitted 15 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

  9. arXiv:2407.08583  [pdf, other

    cs.AI cs.CV cs.LG

    The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective

    Authors: Zhen Qin, Daoyuan Chen, Wenhao Zhang, Liuyi Yao, Yilun Huang, Bolin Ding, Yaliang Li, Shuiguang Deng

    Abstract: The rapid development of large language models (LLMs) has been witnessed in recent years. Based on the powerful LLMs, multi-modal LLMs (MLLMs) extend the modality from text to a broader spectrum of domains, attracting widespread attention due to the broader range of application scenarios. As LLMs and MLLMs rely on vast amounts of model parameters and data to achieve emergent capabilities, the impo… ▽ More

    Submitted 5 August, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: Ongoing work. 21 pages. Related materials are continually maintained and available at https://github.com/modelscope/data-juicer/blob/main/docs/awesome_llm_data.md

  10. arXiv:2407.05784  [pdf, other

    cs.AR

    Hecaton: Training and Finetuning Large Language Models with Scalable Chiplet Systems

    Authors: Zongle Huang, Shupei Fan, Chen Tang, Xinyuan Lin, Shuwen Deng, Yongpan Liu

    Abstract: Large Language Models (LLMs) have achieved remarkable success in various fields, but their training and finetuning require massive computation and memory, necessitating parallelism which introduces heavy communication overheads. Driven by advances in packaging, the chiplet architecture emerges as a potential solution, as it can integrate computing power, as well as utilize on-package links with be… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  11. arXiv:2407.04272  [pdf, other

    cs.LG cs.DC

    Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression

    Authors: Hao Feng, Boyuan Zhang, Fanjiang Ye, Min Si, Ching-Hsiang Chu, Jiannan Tian, Chunxing Yin, Summer Deng, Yuchen Hao, Pavan Balaji, Tong Geng, Dingwen Tao

    Abstract: DLRM is a state-of-the-art recommendation system model that has gained widespread adoption across various industry applications. The large size of DLRM models, however, necessitates the use of multiple devices/GPUs for efficient training. A significant bottleneck in this process is the time-consuming all-to-all communication required to collect embedding data from all devices. To mitigate this, we… ▽ More

    Submitted 11 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: accepted by SC '24

  12. arXiv:2407.04192  [pdf, other

    cs.LG

    KAN-ODEs: Kolmogorov-Arnold Network Ordinary Differential Equations for Learning Dynamical Systems and Hidden Physics

    Authors: Benjamin C. Koenig, Suyong Kim, Sili Deng

    Abstract: Kolmogorov-Arnold networks (KANs) as an alternative to multi-layer perceptrons (MLPs) are a recent development demonstrating strong potential for data-driven modeling. This work applies KANs as the backbone of a neural ordinary differential equation (ODE) framework, generalizing their use to the time-dependent and temporal grid-sensitive cases often seen in dynamical systems and scientific machine… ▽ More

    Submitted 18 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: B.C.K. and S.K. contributed equally to this work. 20 pages, 10 figures, and 4 tables. Revised upload includes additional examples and extended discussion of existing examples

    ACM Class: I.6.5; G.1.7

  13. arXiv:2407.01914  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Switchable Ferroelectricity in Subnano Silicon Thin Films

    Authors: Hongyu Yu, Shihan deng, Muting Xie, Yuwen Zhang, Xizhi Shi, Jianxin Zhong, Chaoyu He, Hongjun Xiang

    Abstract: Recent advancements underscore the critical need to develop ferroelectric materials compatible with silicon. We systematically explore possible ferroelectric silicon quantum films and discover a low-energy variant (hex-OR-2*2-P) with energy just 1 meV/atom above the ground state (hex-OR-2*2). Both hex-OR-2*2 and hex-OR-2*2-P are confirmed to be dynamically and mechanically stable semiconductors wi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 18 pages, 3 figures

  14. arXiv:2407.01351  [pdf, other

    astro-ph.HE

    Probing the connection between IceCube neutrinos and MOJAVE AGN

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: Active Galactic Nuclei (AGN) are prime candidate sources of the high-energy, astrophysical neutrinos detected by IceCube. This is demonstrated by the real-time multi-messenger detection of the blazar TXS 0506+056 and the recent evidence of neutrino emission from NGC 1068 from a separate time-averaged study. However, the production mechanism of the astrophysical neutrinos in AGN is not well establi… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 14 Pages 7 Figures

  15. arXiv:2407.01314  [pdf, other

    hep-ex

    Search for a light sterile neutrino with 7.5 years of IceCube DeepCore data

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: We present a search for an eV-scale sterile neutrino using 7.5 years of data from the IceCube DeepCore detector. The analysis uses a sample of 21,914 events with energies between 5 and 150 GeV to search for sterile neutrinos through atmospheric muon neutrino disappearance. Improvements in event selection and treatment of systematic uncertainties provide greater statistical power compared to previo… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 11 pages, 5 figures. To be submitted to Physical Review D

  16. arXiv:2407.00993  [pdf, other

    cs.AI cs.CL

    Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents

    Authors: Shihan Deng, Weikai Xu, Hongda Sun, Wei Liu, Tao Tan, Jianfeng Liu, Ang Li, Jian Luan, Bin Wang, Rui Yan, Shuo Shang

    Abstract: With the remarkable advancements of large language models (LLMs), LLM-based agents have become a research hotspot in human-computer interaction. However, there is a scarcity of benchmarks available for LLM-based mobile agents. Benchmarking these agents generally faces three main challenges: (1) The inefficiency of UI-only operations imposes limitations to task evaluation. (2) Specific instructions… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  17. arXiv:2406.14535  [pdf, other

    stat.ME math.ST

    On estimation and order selection for multivariate extremes via clustering

    Authors: Shiyuan Deng, He Tang, Shuyang Bai

    Abstract: We investigate the estimation of multivariate extreme models with a discrete spectral measure using spherical clustering techniques. The primary contribution involves devising a method for selecting the order, that is, the number of clusters. The method consistently identifies the true order, i.e., the number of spectral atoms, and enjoys intuitive implementation in practice. Specifically, we intr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 31 pages, 12 figures

    MSC Class: 62G32 (Primary); 60G70 (Secondary)

  18. arXiv:2406.11655  [pdf

    physics.app-ph physics.optics

    Monolithic Multi-parameter Terahertz Nano-micro Detector Based on Plasmon Polariton Atomic Cavity

    Authors: Huanjun Chen, Ximiao Wang, Shaojing Liu, Zhaolong Cao, Jinyang Li, Hongjia Zhu, Shangdong Li, Ningsheng Xu, Shaozhi Deng

    Abstract: Terahertz signals hold significant potential for ultra-wideband communication and high-resolution radar, necessitating miniaturized detectors capable of multi-parameter detection of intensity, frequency, polarization, and phase. Conventional detectors cannot meet these requirements. Here, we propose plasmon polariton atomic cavities (PPAC) made from single-atom-thick graphene, demonstrating the mo… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  19. arXiv:2406.11087  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    DP-MemArc: Differential Privacy Transfer Learning for Memory Efficient Language Models

    Authors: Yanming Liu, Xinyue Peng, Yuwei Zhang, Xiaolan Ke, Songhang Deng, Jiannan Cao, Chen Ma, Mengchen Fu, Xuhong Zhang, Sheng Cheng, Xun Wang, Jianwei Yin, Tianyu Du

    Abstract: Large language models have repeatedly shown outstanding performance across diverse applications. However, deploying these models can inadvertently risk user privacy. The significant memory demands during training pose a major challenge in terms of resource consumption. This substantial size places a heavy load on memory resources, raising considerable practical concerns. In this paper, we introduc… ▽ More

    Submitted 15 August, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: 9 pages second version

  20. arXiv:2406.08372  [pdf, other

    cs.CV

    APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation

    Authors: Weizhao He, Yang Zhang, Wei Zhuo, Linlin Shen, Jiaqi Yang, Songhe Deng, Liang Sun

    Abstract: Few-shot semantic segmentation (FSS) endeavors to segment unseen classes with only a few labeled samples. Current FSS methods are commonly built on the assumption that their training and application scenarios share similar domains, and their performances degrade significantly while applied to a distinct domain. To this end, we propose to leverage the cutting-edge foundation model, the Segment Anyt… ▽ More

    Submitted 12 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 15 pages, 9 figures

  21. arXiv:2406.07686  [pdf, other

    cs.CV

    AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation

    Authors: Kai Wang, Shijian Deng, Jing Shi, Dimitrios Hatzinakos, Yapeng Tian

    Abstract: Recent Diffusion Transformers (DiTs) have shown impressive capabilities in generating high-quality single-modality content, including images, videos, and audio. However, it is still under-explored whether the transformer-based diffuser can efficiently denoise the Gaussian noises towards superb multimodal content creation. To bridge this gap, we introduce AV-DiT, a novel and efficient audio-visual… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  22. arXiv:2406.07601  [pdf, other

    astro-ph.HE hep-ex

    IceCube Search for Neutrino Emission from X-ray Bright Seyfert Galaxies

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

    Abstract: The recent IceCube detection of TeV neutrino emission from the nearby active galaxy NGC 1068 suggests that active galactic nuclei (AGN) could make a sizable contribution to the diffuse flux of astrophysical neutrinos. The absence of TeV $γ$-rays from NGC 1068 indicates neutrino production in the vicinity of the supermassive black hole, where the high radiation density leads to $γ$-ray attenuation.… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 17 pages, 9 figures

  23. arXiv:2406.06684  [pdf, other

    astro-ph.HE

    Search for neutrino emission from hard X-ray AGN with IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (401 additional authors not shown)

    Abstract: Active Galactic Nuclei (AGN) are promising candidate sources of high-energy astrophysical neutrinos since they provide environments rich in matter and photon targets where cosmic ray interactions may lead to the production of gamma rays and neutrinos. We searched for high-energy neutrino emission from AGN using the $\textit{Swift}$-BAT Spectroscopic Survey (BASS) catalog of hard X-ray sources and… ▽ More

    Submitted 12 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  24. arXiv:2406.06600  [pdf, other

    cs.LG cs.AI cs.CL

    HORAE: A Domain-Agnostic Modeling Language for Automating Multimodal Service Regulation

    Authors: Yutao Sun, Mingshuai Chen, Tiancheng Zhao, Kangjia Zhao, He Li, Jintao Chen, Liqiang Lu, Xinkui Zhao, Shuiguang Deng, Jianwei Yin

    Abstract: Artificial intelligence is rapidly encroaching on the field of service regulation. This work presents the design principles behind HORAE, a unified specification language to model multimodal regulation rules across a diverse set of domains. We show how HORAE facilitates an intelligent service regulation pipeline by further exploiting a fine-tuned large language model named HORAE that automates the… ▽ More

    Submitted 18 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  25. arXiv:2406.05639  [pdf, other

    cs.SE

    A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Automated Program Repair

    Authors: Guochang Li, Chen Zhi, Jialiang Chen, Junxiao Han, Shuiguang Deng

    Abstract: Automated Program Repair (APR) aims to fix bugs by generating patches. And existing work has demonstrated that "pre-training and fine-tuning" paradigm enables Large Language Models (LLMs) improve fixing capabilities on APR. However, existing work mainly focuses on Full-Model Fine-Tuning (FMFT) for APR and limited research has been conducted on the execution-based evaluation of Parameter-Efficient… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  26. arXiv:2406.05287  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Group-wise oracle-efficient algorithms for online multi-group learning

    Authors: Samuel Deng, Daniel Hsu, Jingwen Liu

    Abstract: We study the problem of online multi-group learning, a learning model in which an online learner must simultaneously achieve small prediction regret on a large collection of (possibly overlapping) subsequences corresponding to a family of groups. Groups are subsets of the context space, and in fairness applications, they may correspond to subpopulations defined by expressive functions of demograph… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  27. arXiv:2406.04657  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Crafting Heavy-Tails in Weight Matrix Spectrum without Gradient Noise

    Authors: Vignesh Kothapalli, Tianyu Pang, Shenyang Deng, Zongmin Liu, Yaoqing Yang

    Abstract: Modern training strategies of deep neural networks (NNs) tend to induce a heavy-tailed (HT) spectra of layer weights. Extensive efforts to study this phenomenon have found that NNs with HT weight spectra tend to generalize well. A prevailing notion for the occurrence of such HT spectra attributes gradient noise during training as a key contributing factor. Our work shows that gradient noise is unn… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 31 pages, 37 figures

  28. arXiv:2406.02554  [pdf, other

    eess.AS cs.AI cs.CL cs.CV cs.LG cs.MM

    Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition

    Authors: Shijian Deng, Erin E. Kosloski, Siddhi Patel, Zeke A. Barnett, Yiyang Nan, Alexander Kaplan, Sisira Aarukapalli, William T. Doan, Matthew Wang, Harsh Singh, Pamela R. Rollins, Yapeng Tian

    Abstract: In this article, we introduce a novel problem of audio-visual autism behavior recognition, which includes social behavior recognition, an essential aspect previously omitted in AI-assisted autism screening research. We define the task at hand as one that is audio-visual autism behavior recognition, which uses audio and visual cues, including any speech present in the audio, to recognize autism-rel… ▽ More

    Submitted 22 March, 2024; originally announced June 2024.

  29. arXiv:2406.02437  [pdf, other

    econ.GN

    Algorithmic Collusion in Dynamic Pricing with Deep Reinforcement Learning

    Authors: Shidi Deng, Maximilian Schiffer, Martin Bichler

    Abstract: Nowadays, a significant share of the Business-to-Consumer sector is based on online platforms like Amazon and Alibaba and uses Artificial Intelligence for pricing strategies. This has sparked debate on whether pricing algorithms may tacitly collude to set supra-competitive prices without being explicitly designed to do so. Our study addresses these concerns by examining the risk of collusion when… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  30. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  31. arXiv:2406.00905  [pdf, other

    hep-ex

    Exploration of mass splitting and muon/tau mixing parameters for an eV-scale sterile neutrino with IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (400 additional authors not shown)

    Abstract: We present the first three-parameter fit to a 3+1 sterile neutrino model using 7.634 years of data from the IceCube Neutrino Observatory on $ν_μ+\overlineν_μ$ charged-current interactions in the energy range 500-9976 GeV. Our analysis is sensitive to the mass-squared splitting between the heaviest and lightest mass state ($Δm_{41}^2$), the mixing matrix element connecting muon flavor to the fourth… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  32. arXiv:2405.17969  [pdf, other

    cs.CL cs.AI cs.CV cs.IR cs.LG

    Knowledge Circuits in Pretrained Transformers

    Authors: Yunzhi Yao, Ningyu Zhang, Zekun Xi, Mengru Wang, Ziwen Xu, Shumin Deng, Huajun Chen

    Abstract: The remarkable capabilities of modern large language models are rooted in their vast repositories of knowledge encoded within their parameters, enabling them to perceive the world and engage in reasoning. The inner workings of how these models store knowledge have long been a subject of intense interest and investigation among researchers. To date, most studies have concentrated on isolated compon… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Work in progress, 25 pages

  33. arXiv:2405.17883  [pdf, other

    physics.atom-ph

    Three-dimensional Magneto-optical Trapping of Barium Monofluoride

    Authors: Zixuan Zeng, Shuhua Deng, Shoukang Yang, Bo Yan

    Abstract: As a heavy molecule, barium monofluoride (BaF) presents itself as a promising candidate for measuring permanent electric dipole moment. The precision of such measurements can be significantly enhanced by utilizing a cold molecular sample. Here we report the realization of three-dimensional magneto-optical trapping (MOT) of BaF molecules. Through the repumping of all the vibrational states up to… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 5 pages, 4 figures

  34. arXiv:2405.15329  [pdf, other

    cs.CL

    Decompose and Aggregate: A Step-by-Step Interpretable Evaluation Framework

    Authors: Minzhi Li, Zhengyuan Liu, Shumin Deng, Shafiq Joty, Nancy F. Chen, Min-Yen Kan

    Abstract: The acceleration of Large Language Models (LLMs) research has opened up new possibilities for evaluating generated texts. They serve as scalable and economical evaluators, but the question of how reliable these evaluators are has emerged as a crucial research question. Prior research efforts in the meta-evaluation of LLMs as judges limit the prompting of an LLM to a single use to obtain a final ev… ▽ More

    Submitted 14 June, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

  35. arXiv:2405.14205  [pdf, other

    cs.CL cs.AI cs.CV cs.LG cs.MA

    Agent Planning with World Knowledge Model

    Authors: Shuofei Qiao, Runnan Fang, Ningyu Zhang, Yuqi Zhu, Xiang Chen, Shumin Deng, Yong Jiang, Pengjun Xie, Fei Huang, Huajun Chen

    Abstract: Recent endeavors towards directly using large language models (LLMs) as agent models to execute interactive planning tasks have shown commendable results. Despite their achievements, however, they still struggle with brainless trial-and-error in global planning and generating hallucinatory actions in local planning due to their poor understanding of the ''real'' physical world. Imitating humans' m… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Work in progress

  36. arXiv:2405.08077  [pdf, other

    hep-ex hep-ph

    Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube

    Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (398 additional authors not shown)

    Abstract: We provide supporting details for the search for a 3+1 sterile neutrino using data collected over eleven years at the IceCube Neutrino Observatory. The analysis uses atmospheric muon-flavored neutrinos from 0.5 to 100\, TeV that traverse the Earth to reach the IceCube detector, and finds a best-fit point at $\sin^2(2θ_{24}) = 0.16$ and $Δm^{2}_{41} = 3.5$ eV$^2$ with a goodness-of-fit p-value of 1… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 18 pages, 17 figures, 2 tables. This long-form paper is a companion to the letter "A search for an eV-scale sterile neutrino using improved high-energy νμ event reconstruction in IceCube."

  37. arXiv:2405.08070  [pdf, other

    hep-ex hep-ph

    A search for an eV-scale sterile neutrino using improved high-energy $ν_μ$ event reconstruction in IceCube

    Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (398 additional authors not shown)

    Abstract: This Letter presents the result of a 3+1 sterile neutrino search using 10.7 years of IceCube data. We analyze atmospheric muon neutrinos that traverse the Earth with energies ranging from 0.5 to 100 TeV, incorporating significant improvements in modeling neutrino flux and detector response compared to earlier studies. Notably, for the first time, we categorize data into starting and through-going… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 9 pages, 3 figures. This letter is supported by the long-form paper "Methods and stability tests associated with the sterile neutrino search using improved high-energy $ν_μ$ event reconstruction in IceCube," also appearing on arXiv

  38. arXiv:2405.03817  [pdf, other

    astro-ph.HE

    Search for joint multimessenger signals from potential Galactic PeVatrons with HAWC and IceCube

    Authors: R. Alfaro, C. Alvarez, J. C. Arteaga-Velázquez, D. Avila Rojas, H. A. Ayala Solares, R. Babu, E. Belmont-Moreno, K. S. Caballero-Mora, T. Capistrán, A. Carramiñana, S. Casanova, U. Cotti, J. Cotzomi, S. Coutiño de León, E. De la Fuente, D. Depaoli, N. Di Lalla, R. Diaz Hernandez, J. C. Díaz-Vélez, K. Engel, T. Ergin, K. L. Fan, K. Fang, N. Fraija, S. Fraija , et al. (469 additional authors not shown)

    Abstract: Galactic PeVatrons are sources that can accelerate cosmic rays to PeV energies. The high-energy cosmic rays are expected to interact with the surrounding ambient material or radiation, resulting in the production of gamma rays and neutrinos. To optimize for the detection of such associated production of gamma rays and neutrinos for a given source morphology and spectrum, a multi-messenger analysis… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  39. arXiv:2405.03558  [pdf, other

    cond-mat.mtrl-sci

    Progress in Computational Understanding of Ferroelectric Mechanisms in HfO$_2$

    Authors: Tianyuan Zhu, Liyang Ma, Shiqing Deng, Shi Liu

    Abstract: Since the first report of ferroelectricity in nanoscale HfO$_2$-based thin films in 2011, this silicon-compatible binary oxide has quickly garnered intense interest in academia and industry, and continues to do so. Despite its deceivingly simple chemical composition, the ferroelectric physics supported by HfO$_2$ is remarkably complex, arguably rivaling that of perovskite ferroelectrics. Computati… ▽ More

    Submitted 11 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  40. arXiv:2405.01203  [pdf, other

    cond-mat.mtrl-sci

    Optimization of reactively sputtered Mn3GaN films based on resistivity measurements

    Authors: Christoph Sürgers, Gerda Fischer, Sihao Deng, Dongmei Hu, Cong Wang

    Abstract: Mn-based nitrides with antiperovskite structures have several properties that can be utilised for antiferromagnetic spintronics. Their magnetic properties depend on the structural quality, composition and doping of the cubic antiperovskite structure. Such nitride thin films are usually produced by reactive physical vapour deposition, where the deposition rate of N can only be controlled by the N2… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: 7 pages, 5 figures, 43 references

  41. arXiv:2404.19589  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Acceptance Tests of more than 10 000 Photomultiplier Tubes for the multi-PMT Digital Optical Modules of the IceCube Upgrade

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (399 additional authors not shown)

    Abstract: More than 10,000 photomultiplier tubes (PMTs) with a diameter of 80 mm will be installed in multi-PMT Digital Optical Modules (mDOMs) of the IceCube Upgrade. These have been tested and pre-calibrated at two sites. A throughput of more than 1000 PMTs per week with both sites was achieved with a modular design of the testing facilities and highly automated testing procedures. The testing facilities… ▽ More

    Submitted 20 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: 24 pages, 19 figures, 2 tables, submitted to JINST

  42. arXiv:2404.17876  [pdf, other

    cs.CV

    DF-SLAM: Dictionary Factors Representation for High-Fidelity Neural Implicit Dense Visual SLAM System

    Authors: Weifeng Wei, Jie Wang, Shuqi Deng, Jie Liu

    Abstract: We introduce a high-fidelity neural implicit dense visual Simultaneous Localization and Mapping (SLAM) system, termed DF-SLAM. In our work, we employ dictionary factors for scene representation, encoding the geometry and appearance information of the scene as a combination of basis and coefficient factors. Compared to neural implicit dense visual SLAM methods that directly encode scene information… ▽ More

    Submitted 25 June, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

  43. arXiv:2404.15237  [pdf

    cond-mat.mtrl-sci

    Insights into the defect-driven heterogeneous structural evolution of Ni-rich layered cathode in lithium-ion batteries

    Authors: Zhongyuan Huang, Ziwei Chen, Maolin Yang, Mihai Chu, Zenan Li, Sihao Deng, Lunhua He, Lei Jin, Rafal E. Dunin-Borkowski, Rui Wang, Jun Wang, Tingting Yang, Yinguo Xiao

    Abstract: Recently, considerable efforts have been made on research and improvement for Ni-rich lithium-ion batteries to meet the demand from vehicles and grid-level large-scale energy storage. Development of next-generation high-performance lithium-ion batteries requires a comprehensive understanding on the underlying electrochemical mechanisms associated with its structural evolution. In this work, advanc… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 29 pages and 5 figures for manuscript; 30 pages, 14 figures and 4 tables for supplementary information

  44. arXiv:2404.14755  [pdf, other

    cs.MM cs.AI cs.CV cs.HC

    SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models

    Authors: Bo Lin, Yingjing Xu, Xuanwen Bao, Zhou Zhao, Zuyong Zhang, Zhouyang Wang, Jie Zhang, Shuiguang Deng, Jianwei Yin

    Abstract: With the continuous advancement of vision language models (VLMs) technology, remarkable research achievements have emerged in the dermatology field, the fourth most prevalent human disease category. However, despite these advancements, VLM still faces "hallucination" in dermatological diagnosis, and due to the inherent complexity of dermatological conditions, existing tools offer relatively limite… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  45. arXiv:2404.06443  [pdf, other

    cs.CV

    Multi-scale Dynamic and Hierarchical Relationship Modeling for Facial Action Units Recognition

    Authors: Zihan Wang, Siyang Song, Cheng Luo, Songhe Deng, Weicheng Xie, Linlin Shen

    Abstract: Human facial action units (AUs) are mutually related in a hierarchical manner, as not only they are associated with each other in both spatial and temporal domains but also AUs located in the same/close facial regions show stronger relationships than those of different facial regions. While none of existing approach thoroughly model such hierarchical inter-dependencies among AUs, this paper propos… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR2024

  46. arXiv:2404.01687  [pdf, other

    hep-ex

    Search for a sub-eV sterile neutrino using Daya Bay's full dataset

    Authors: F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding, Y. Y. Ding , et al. (176 additional authors not shown)

    Abstract: This Letter presents results of a search for the mixing of a sub-eV sterile neutrino with three active neutrinos based on the full data sample of the Daya Bay Reactor Neutrino Experiment, collected during 3158 days of detector operation, which contains $5.55 \times 10^{6}$ reactor \anue candidates identified as inverse beta-decay interactions followed by neutron-capture on gadolinium. The analysis… ▽ More

    Submitted 20 August, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 7 pages, 4 figures, 1 table

  47. arXiv:2404.01600  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    C-type antiferromagnetic structure of topological semimetal CaMnSb$_2$

    Authors: Bo Li, Xu-Tao Zeng, Qianhui Xu, Fan Yang, Junsen Xiang, Hengyang Zhong, Sihao Deng, Lunhua He, Juping Xu, Wen Yin, Xingye Lu, Huiying Liu, Xian-Lei Sheng, Wentao Jin

    Abstract: Determination of the magnetic structure and confirmation of the presence or absence of inversion ($\mathcal{P}$) and time reversal ($\mathcal{T}$) symmetry is imperative for correctly understanding the topological magnetic materials. Here high-quality single crystals of the layered manganese pnictide CaMnSb$_2$ are synthesized using the self-flux method. De Haas-van Alphen oscillations indicate a… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 7 Pages, 6 figures

    Journal ref: Chinese Physics Letters 41, 037104 (2024)

  48. arXiv:2403.19964  [pdf, other

    cs.CV cs.CY cs.LG

    FairRAG: Fair Human Generation via Fair Retrieval Augmentation

    Authors: Robik Shrestha, Yang Zou, Qiuyu Chen, Zhiheng Li, Yusheng Xie, Siqi Deng

    Abstract: Existing text-to-image generative models reflect or even amplify societal biases ingrained in their training data. This is especially concerning for human image generation where models are biased against certain demographic groups. Existing attempts to rectify this issue are hindered by the inherent limitations of the pre-trained models and fail to substantially improve demographic diversity. In t… ▽ More

    Submitted 5 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: CVPR 2024

  49. arXiv:2403.19460  [pdf, other

    cs.RO cs.AI

    RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentation

    Authors: Chongkai Gao, Zhengrong Xue, Shuying Deng, Tianhai Liang, Siqi Yang, Lin Shao, Huazhe Xu

    Abstract: We present RiEMann, an end-to-end near Real-time SE(3)-Equivariant Robot Manipulation imitation learning framework from scene point cloud input. Compared to previous methods that rely on descriptor field matching, RiEMann directly predicts the target poses of objects for manipulation without any object segmentation. RiEMann learns a manipulation task from scratch with 5 to 10 demonstrations, gener… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  50. arXiv:2403.14472  [pdf, other

    cs.CL cs.AI cs.CV cs.HC cs.LG

    Detoxifying Large Language Models via Knowledge Editing

    Authors: Mengru Wang, Ningyu Zhang, Ziwen Xu, Zekun Xi, Shumin Deng, Yunzhi Yao, Qishen Zhang, Linyi Yang, Jindong Wang, Huajun Chen

    Abstract: This paper investigates using knowledge editing techniques to detoxify Large Language Models (LLMs). We construct a benchmark, SafeEdit, which covers nine unsafe categories with various powerful attack prompts and equips comprehensive metrics for systematic evaluation. We conduct experiments with several knowledge editing approaches, indicating that knowledge editing has the potential to detoxify… ▽ More

    Submitted 28 May, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: ACL 2024. Project website: https://zjunlp.github.io/project/SafeEdit Benchmark: https://huggingface.co/datasets/zjunlp/SafeEdit