Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 103,723 results for author: Zhang

.
  1. arXiv:2408.08310  [pdf, other

    cs.CL

    ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws

    Authors: Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng

    Abstract: High-quality data is crucial for the pre-training performance of large language models. Unfortunately, existing quality filtering methods rely on a known high-quality dataset as reference, which can introduce potential bias and compromise diversity. In this paper, we propose ScalingFilter, a novel approach that evaluates text quality based on the perplexity difference between two language models t… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  2. arXiv:2408.08302  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Benchmarking the Capabilities of Large Language Models in Transportation System Engineering: Accuracy, Consistency, and Reasoning Behaviors

    Authors: Usman Syed, Ethan Light, Xingang Guo, Huan Zhang, Lianhui Qin, Yanfeng Ouyang, Bin Hu

    Abstract: In this paper, we explore the capabilities of state-of-the-art large language models (LLMs) such as GPT-4, GPT-4o, Claude 3.5 Sonnet, Claude 3 Opus, Gemini 1.5 Pro, Llama 3, and Llama 3.1 in solving some selected undergraduate-level transportation engineering problems. We introduce TransportBench, a benchmark dataset that includes a sample of transportation engineering problems on a wide range of… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  3. arXiv:2408.08299  [pdf, other

    astro-ph.GA astro-ph.SR

    Dynamical Accretion Flows -- ALMAGAL: Flows along filamentary structures in high-mass star-forming clusters

    Authors: M. R. A. Wells, H. Beuther, S. Molinari, P. Schilke, C. Battersby, P. Ho, Á. Sánchez-Monge, B. Jones, M. B. Scheuck, J. Syed, C. Gieser, R. Kuiper, D. Elia, A. Coletta, A. Traficante, J. Wallace, A. J. Rigby, R. S. Klessen, Q. Zhang, S. Walch, M. T. Beltrán, Y. Tang, G. A. Fuller, D. C. Lis, T. Möller , et al. (25 additional authors not shown)

    Abstract: We use data from the ALMA Evolutionary Study of High Mass Protocluster Formation in the Galaxy (ALMAGAL) survey to study 100 ALMAGAL regions at $\sim$ 1\arcsec~ resolution located between $\sim$ 2 and 6~kpc distance. Using ALMAGAL $\sim$ 1.3mm line and continuum data we estimate flow rates onto individual cores. We focus specifically on flow rates along filamentary structures associated with these… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 11 pages, 11 figures, accepted for publication in A&A

  4. arXiv:2408.08295  [pdf, other

    cs.CV cs.AI cs.LG

    SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training

    Authors: Gengwei Zhang, Liyuan Wang, Guoliang Kang, Ling Chen, Yunchao Wei

    Abstract: In recent years, continual learning with pre-training (CLPT) has received widespread interest, instead of its traditional focus of training from scratch. The use of strong pre-trained models (PTMs) can greatly facilitate knowledge transfer and alleviate catastrophic forgetting, but also suffers from progressive overfitting of pre-trained knowledge into specific downstream tasks. A majority of curr… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: This paper is an extension of our ICCV 23 paper (arXiv:2303.05118)

  5. arXiv:2408.08290  [pdf, other

    cond-mat.mtrl-sci

    Tunable polar distortions and magnetism in Gd$_x$La$_{1-x}$PtSb epitaxial films

    Authors: Dongxue Du, Cheyu Zhang, Jingrui Wei, Yujia Teng, Konrad Genser, Paul M. Voyles, Karin M. Rabe, Jason K. Kawasaki

    Abstract: Hexagonal $ABC$ intermetallics are predicted to have tunable ferroelectric, topological, and magnetic properties as a function of the polar buckling of $BC$ atomic planes. We report the impact of isovalent lanthanide substitution on the buckling, structural phase transitions, and electronic and magnetic properties of Gd$_x$La$_{1-x}$PtSb films grown by molecular beam epitaxy (MBE) on c-plane sapph… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  6. arXiv:2408.08274  [pdf, other

    cs.LG

    BAM! Just Like That: Simple and Efficient Parameter Upcycling for Mixture of Experts

    Authors: Qizhen Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo, David Cairuz, Bharat Venkitesh, Jakob Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Ustun, Acyr Locatelli

    Abstract: The Mixture of Experts (MoE) framework has become a popular architecture for large language models due to its superior performance over dense models. However, training MoEs from scratch in a large-scale regime is prohibitively expensive. Existing methods mitigate this by pre-training multiple dense expert models independently and using them to initialize an MoE. This is done by using experts' feed… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  7. arXiv:2408.08266  [pdf, ps, other

    math.AG

    IVHS via Kuznetsov components and categorical Torelli theorems for weighted hypersurfaces

    Authors: Xun Lin, Jørgen Vold Rennemo, Shizhuo Zhang

    Abstract: We study the categorical Torelli theorem for smooth (weighted) hypersurfaces in (weighted) projective spaces via the Hochschild--Serre algebra of its Kuznetsov component. In the first part of the paper, we show that a natural graded subalgebra of the Hochschild--Serre algebra of the Kuznetsov component of a degree $d$ weighted hypersurface in $\mathbb{P}(a_0,\ldots,a_n)$ reconstructs the graded su… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 28 pages, comments are very welcome

    MSC Class: Primary 14F05; secondary 14J45; 14D20; 14D23

  8. arXiv:2408.08242  [pdf, ps, other

    cs.RO cs.AI cs.LG eess.SY

    A Conflicts-free, Speed-lossless KAN-based Reinforcement Learning Decision System for Interactive Driving in Roundabouts

    Authors: Zhihao Lin, Zhen Tian, Qi Zhang, Ziyang Ye, Hanyang Zhuang, Jianglin Lan

    Abstract: Safety and efficiency are crucial for autonomous driving in roundabouts, especially in the context of mixed traffic where autonomous vehicles (AVs) and human-driven vehicles coexist. This paper introduces a learning-based algorithm tailored to foster safe and efficient driving behaviors across varying levels of traffic flows in roundabouts. The proposed algorithm employs a deep Q-learning network… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 15 pages, 12 figures, submitted to an IEEE journal

  9. arXiv:2408.08232  [pdf, ps, other

    math.OC

    Equivalent Characterizations of the Aubin Property for Nonlinear Semidefinite Programming

    Authors: Liang Chen, Ruoning Chen, Defeng Sun, Liping Zhang

    Abstract: In this paper, we study the Aubin property of the Karush-Kuhn-Tucker solution mapping for the nonlinear semidefinite programming (NLSDP) problem at a locally optimal solution. In the literature, it is known that the Aubin property implies the constraint nondegeneracy by Fusek [SIAM J. Optim. 23 (2013), pp. 1041-1061] and the second-order sufficient condition by Ding et al. [SIAM J. Optim. 27 (2017… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    MSC Class: 49J53; 90C22; 90C31; 90C46

  10. arXiv:2408.08231  [pdf, other

    cs.IR

    DaRec: A Disentangled Alignment Framework for Large Language Model and Recommender System

    Authors: Xihong Yang, Heming Jing, Zixing Zhang, Jindong Wang, Huakang Niu, Shuaiqiang Wang, Yu Lu, Junfeng Wang, Dawei Yin, Xinwang Liu, En Zhu, Defu Lian, Erxue Min

    Abstract: Benefiting from the strong reasoning capabilities, Large language models (LLMs) have demonstrated remarkable performance in recommender systems. Various efforts have been made to distill knowledge from LLMs to enhance collaborative models, employing techniques like contrastive learning for representation alignment. In this work, we prove that directly aligning the representations of LLMs and colla… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  11. arXiv:2408.08222  [pdf, other

    cs.LG

    Enhancing Sharpness-Aware Minimization by Learning Perturbation Radius

    Authors: Xuehao Wang, Weisen Jiang, Shuai Fu, Yu Zhang

    Abstract: Sharpness-aware minimization (SAM) is to improve model generalization by searching for flat minima in the loss landscape. The SAM update consists of one step for computing the perturbation and the other for computing the update gradient. Within the two steps, the choice of the perturbation radius is crucial to the performance of SAM, but finding an appropriate perturbation radius is challenging. I… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: Accepted by ECML PKDD 2024

  12. arXiv:2408.08209  [pdf, other

    cs.IR

    Modeling Domain and Feedback Transitions for Cross-Domain Sequential Recommendation

    Authors: Changshuo Zhang, Teng Shi, Xiao Zhang, Qi Liu, Ruobing Xie, Jun Xu, Ji-Rong Wen

    Abstract: Nowadays, many recommender systems encompass various domains to cater to users' diverse needs, leading to user behaviors transitioning across different domains. In fact, user behaviors across different domains reveal changes in preference toward recommended items. For instance, a shift from negative feedback to positive feedback indicates improved user satisfaction. However, existing cross-domain… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  13. arXiv:2408.08196  [pdf, other

    quant-ph

    Revealing inadvertent periodic modulation of qubit frequency

    Authors: Filip Wudarski, Yaxing Zhang, Juan Atalaya, M. I. Dykman

    Abstract: The paper describes the means to reveal and characterize slow periodic modulation of qubit frequency. Such modulation can come from different sources and can impact qubit stability. We show that the modulation leads to very sharp peaks in the power spectrum of outcomes of periodically repeated Ramsey measurements. The positions and shapes of the peaks allow finding both the frequency and the ampli… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  14. arXiv:2408.08194  [pdf, other

    cond-mat.quant-gas cond-mat.mes-hall cond-mat.str-el quant-ph

    Doublons Bloch oscillations in the mass-imbalanced extended Fermi-Hubbard model

    Authors: Kun-Liang Zhang

    Abstract: Interactions between particles normally induce the decay of the particles Bloch oscillations (BOs) in a periodic lattice. In the large on-site interactions region, spin-$1/2$ fermions may form into doublon bound state and undergoes doublon BOs in the present of tilted potential. Here we investigate the impact of nearest-neighbor interaction $V$ on the multi-doublon BOs in a mass-imbalanced extende… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 10 pages, 7 figures

  15. "I Try to Represent Myself as I Am": Self-Presentation Preferences of People with Invisible Disabilities through Embodied Social VR Avatars

    Authors: Ria J. Gualano, Lucy Jiang, Kexin Zhang, Tanisha Shende, Andrea Stevenson Won, Shiri Azenkot

    Abstract: With the increasing adoption of social virtual reality (VR), it is critical to design inclusive avatars. While researchers have investigated how and why blind and d/Deaf people wish to disclose their disabilities in VR, little is known about the preferences of many others with invisible disabilities (e.g., ADHD, dyslexia, chronic conditions). We filled this gap by interviewing 15 participants, eac… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: To appear at ASSETS 2024

  16. arXiv:2408.08192  [pdf, other

    cs.LG cs.GT cs.MA math.OC

    Stochastic Semi-Gradient Descent for Learning Mean Field Games with Population-Aware Function Approximation

    Authors: Chenyu Zhang, Xu Chen, Xuan Di

    Abstract: Mean field games (MFGs) model the interactions within a large-population multi-agent system using the population distribution. Traditional learning methods for MFGs are based on fixed-point iteration (FPI), which calculates best responses and induced population distribution separately and sequentially. However, FPI-type methods suffer from inefficiency and instability, due to oscillations caused b… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  17. arXiv:2408.08162  [pdf, ps, other

    physics.optics math-ph

    Perturbation theory for resonant states near a bound state in the continuum

    Authors: Nan Zhang, Ya Yan Lu

    Abstract: In this work, we develop a perturbation theory to analyze resonant states near a bound state in the continuum (BIC) in photonic crystal slabs. The theory allows us to rigorously determine the asymptotic behavior of $Q$-factor and the far-field polarization. We show that the resonant states near a BIC can be nearly circularly polarized if the scattering matrix is subject to a certain condition. Mor… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  18. arXiv:2408.08152  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

    Authors: Huajian Xin, Z. Z. Ren, Junxiao Song, Zhihong Shao, Wanjia Zhao, Haocheng Wang, Bo Liu, Liyue Zhang, Xuan Lu, Qiushi Du, Wenjun Gao, Qihao Zhu, Dejian Yang, Zhibin Gou, Z. F. Wu, Fuli Luo, Chong Ruan

    Abstract: We introduce DeepSeek-Prover-V1.5, an open-source language model designed for theorem proving in Lean 4, which enhances DeepSeek-Prover-V1 by optimizing both training and inference processes. Pre-trained on DeepSeekMath-Base with specialization in formal mathematical languages, the model undergoes supervised fine-tuning using an enhanced formal theorem proving dataset derived from DeepSeek-Prover-… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  19. arXiv:2408.08147  [pdf, other

    cs.DC cs.CL cs.LG

    P/D-Serve: Serving Disaggregated Large Language Model at Scale

    Authors: Yibo Jin, Tao Wang, Huimin Lin, Mingyang Song, Peiyang Li, Yipeng Ma, Yicheng Shan, Zhengfan Yuan, Cailong Li, Yajing Sun, Tiandeng Wu, Xing Chu, Ruizhi Huan, Li Ma, Xiao You, Wenting Zhou, Yunpeng Ye, Wen Liu, Xiangkun Xu, Yongsheng Zhang, Tiantian Dong, Jiawei Zhu, Zhe Wang, Xijian Ju, Jianxun Song , et al. (5 additional authors not shown)

    Abstract: Serving disaggregated large language models (LLMs) over tens of thousands of xPU devices (GPUs or NPUs) with reliable performance faces multiple challenges. 1) Ignoring the diversity (various prefixes and tidal requests), treating all the prompts in a mixed pool is inadequate. To facilitate the similarity per scenario and minimize the inner mismatch on P/D (prefill and decoding) processing, fine-g… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  20. arXiv:2408.08146  [pdf, other

    cs.CL

    KOALA: Enhancing Speculative Decoding for LLM via Multi-Layer Draft Heads with Adversarial Learning

    Authors: Kaiqi Zhang, Jing Zhao, Rui Chen

    Abstract: Large Language Models (LLMs) exhibit high inference latency due to their autoregressive decoding nature. While the draft head in speculative decoding mitigates this issue, its full potential remains unexplored. In this paper, we introduce KOALA (K-layer Optimized Adversarial Learning Architecture), an orthogonal approach to the draft head. By transforming the conventional single-layer draft head i… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  21. arXiv:2408.08120  [pdf, other

    cond-mat.mes-hall physics.app-ph

    Study of non-diffusive thermal behaviors in nanoscale transistors under different heating strategies

    Authors: Chuang Zhang, Ziyang Xin, Qin Lou, Hong Liang

    Abstract: Understanding the phonon transport mechanisms and efficiently capturing the spatiotemporal distributions of temperature is of great significance for alleviating hotspot issues in the electronic devices. Most previous simulations mainly focused on the steady-state problem with continuous heating, and the effective Fourier's law (EFL) is widely used for practical multiscale thermal engineering due t… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 25 pages, 52 figures

    MSC Class: 82D37; 80A05 80A19

  22. arXiv:2408.08117  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Two-dimensional superconductivity in a thick exfoliated kagome film

    Authors: Fei Sun, Andrea Capa Salinas, Stephen D. Wilson, Haijing Zhang

    Abstract: We report the observation of two-dimensional superconductivity (2D SC) in exfoliated kagome metal CsV$_3$Sb$_5$ with a thickness far thicker than the atomic limit. By examining the critical current and upper critical magnetic fields ($H_{c2}$) of 40-60 nm thick films in the superconducting state, we identify a pronounced Berezinskii-Kosterlitz-Thouless (BKT) transition behavior, i.e. a drastic dec… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 6 pages and 4 figures for the main text; 10 pages and 9 figures for the Supplementary Materials

  23. arXiv:2408.08108  [pdf, other

    cs.CV

    Unsupervised Part Discovery via Dual Representation Alignment

    Authors: Jiahao Xia, Wenjian Huang, Min Xu, Jianguo Zhang, Haimin Zhang, Ziyu Sheng, Dong Xu

    Abstract: Object parts serve as crucial intermediate representations in various downstream tasks, but part-level representation learning still has not received as much attention as other vision tasks. Previous research has established that Vision Transformer can learn instance-level attention without labels, extracting high-quality instance-level representations for boosting downstream tasks. In this paper,… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: Accepted by TPAMI-2024

  24. arXiv:2408.08105  [pdf, other

    cs.CV cs.AI

    Multimodal Causal Reasoning Benchmark: Challenging Vision Large Language Models to Infer Causal Links Between Siamese Images

    Authors: Zhiyuan Li, Heng Wang, Dongnan Liu, Chaoyi Zhang, Ao Ma, Jieting Long, Weidong Cai

    Abstract: Large Language Models (LLMs) have showcased exceptional ability in causal reasoning from textual information. However, will these causalities remain straightforward for Vision Large Language Models (VLLMs) when only visual hints are provided? Motivated by this, we propose a novel Multimodal Causal Reasoning benchmark, namely MuCR, to challenge VLLMs to infer semantic cause-and-effect relationship… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 20 pages

  25. arXiv:2408.08101  [pdf

    eess.SY

    Stochastic Real-Time Economic Dispatch for Integrated Electric and Gas Systems Considering Uncertainty Propagation and Pipeline Leakage

    Authors: eiyao Zhao, Zhengshuo Li, Jiahui Zhang, Xiang Bai, Jia Su

    Abstract: Gas-fired units (GFUs) with rapid regulation capabilities are considered an effective tool to mitigate fluctuations in the generation of renewable energy sources and have coupled electricity power systems (EPSs) and natural gas systems (NGSs) more tightly. However, this tight coupling leads to uncertainty propagation, a challenge for the real-time dispatch of such integrated electric and gas syste… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  26. arXiv:2408.08093  [pdf, other

    cs.CV cs.MM

    When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding

    Authors: Pingping Zhang, Jinlong Li, Meng Wang, Nicu Sebe, Sam Kwong, Shiqi Wang

    Abstract: Existing codecs are designed to eliminate intrinsic redundancies to create a compact representation for compression. However, strong external priors from Multimodal Large Language Models (MLLMs) have not been explicitly explored in video compression. Herein, we introduce a unified paradigm for Cross-Modality Video Coding (CMVC), which is a pioneering approach to explore multimodality representatio… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  27. arXiv:2408.08072  [pdf, other

    cs.CL

    I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm

    Authors: Yiming Liang, Ge Zhang, Xingwei Qu, Tianyu Zheng, Jiawei Guo, Xinrun Du, Zhenzhu Yang, Jiaheng Liu, Chenghua Lin, Lei Ma, Wenhao Huang, Jiajun Zhang

    Abstract: Large Language Models (LLMs) have achieved significant advancements, however, the common learning paradigm treats LLMs as passive information repositories, neglecting their potential for active learning and alignment. Some approaches train LLMs using their own generated synthetic data, exploring the possibility of active alignment. However, there is still a huge gap between these one-time alignmen… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  28. arXiv:2408.08067  [pdf, other

    cs.CL cs.AI

    RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation

    Authors: Dongyu Ru, Lin Qiu, Xiangkun Hu, Tianhang Zhang, Peng Shi, Shuaichen Chang, Jiayang Cheng, Cunxiang Wang, Shichao Sun, Huanyu Li, Zizhao Zhang, Binjie Wang, Jiarong Jiang, Tong He, Zhiguo Wang, Pengfei Liu, Yue Zhang, Zheng Zhang

    Abstract: Despite Retrieval-Augmented Generation (RAG) has shown promising capability in leveraging external knowledge, a comprehensive evaluation of RAG systems is still challenging due to the modular nature of RAG, evaluation of long-form responses and reliability of measurements. In this paper, we propose a fine-grained evaluation framework, RAGChecker, that incorporates a suite of diagnostic metrics for… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: Under Review

  29. arXiv:2408.08066  [pdf, other

    cs.IR

    Mamba Retriever: Utilizing Mamba for Effective and Efficient Dense Retrieval

    Authors: Hanqi Zhang, Chong Chen, Lang Mei, Qi Liu, Jiaxin Mao

    Abstract: In the information retrieval (IR) area, dense retrieval (DR) models use deep learning techniques to encode queries and passages into embedding space to compute their semantic relations. It is important for DR models to balance both efficiency and effectiveness. Pre-trained language models (PLMs), especially Transformer-based PLMs, have been proven to be effective encoders of DR models. However, th… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  30. arXiv:2408.08063  [pdf, other

    astro-ph.CO

    Constraining Ultralight ALP Dark Matter in Light of Cosmic Birefringence

    Authors: Dongdong Zhang, Elisa G. M. Ferreira, Ippei Obata, Toshiya Namikawa

    Abstract: Cosmic birefringence, the observed rotation of the polarization plane of the cosmic microwave background (CMB), serves as a compelling probe for parity-violating physics beyond the Standard Model. This study explores the potential of ultralight axion-like particle (ALP) dark matter to explain the observed cosmic birefringence in the CMB. We focus on the previously understudied mass range of… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  31. arXiv:2408.08057  [pdf, other

    eess.SP

    Optimal Joint Fronthaul Compression and Beamforming Design for Networked ISAC Systems

    Authors: Kexin Zhang, Yanqing Xu, Ruisi He, Chao Shen, Tsung-hui Chang

    Abstract: This study investigates a networked integrated sensing and communication (ISAC) system, where multiple base stations (BSs), connected to a central processor (CP) via capacity-limited fronthaul links, cooperatively serve communication users while simultaneously sensing a target. The primary objective is to minimize the total transmit power while meeting the signal-to-interference-plus-noise ratio (… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  32. arXiv:2408.08050  [pdf, other

    cs.CV

    CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection

    Authors: Xunfa Lai, Zhiyu Yang, Jie Hu, Shengchuan Zhang, Liujuan Cao, Guannan Jiang, Zhiyu Wang, Songan Zhang, Rongrong Ji

    Abstract: Existing camouflaged object detection~(COD) methods depend heavily on large-scale pixel-level annotations.However, acquiring such annotations is laborious due to the inherent camouflage characteristics of the objects.Semi-supervised learning offers a promising solution to this challenge.Yet, its application in COD is hindered by significant pseudo-label noise, both pixel-level and instance-level.W… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: Accepted to ECCV 2024

  33. arXiv:2408.08037  [pdf, other

    physics.bio-ph cond-mat.stat-mech q-bio.MN

    Maximum entropy models for patterns of gene expression

    Authors: Camilla Sarra, Leopoldo Sarra, Luca Di Carlo, Trevor GrandPre, Yaojun Zhang, Curtis G. Callan Jr., William Bialek

    Abstract: New experimental methods make it possible to measure the expression levels of many genes, simultaneously, in snapshots from thousands or even millions of individual cells. Current approaches to analyze these experiments involve clustering or low-dimensional projections. Here we use the principle of maximum entropy to obtain a probabilistic description that captures the observed presence or absence… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 14 pages, 21 figures

  34. arXiv:2408.08034  [pdf, other

    cs.NI

    Centralized Network Utility Maximization with Accelerated Gradient Method

    Authors: Ying Tian, Zhiliang Wang, Xia Yin, Xingang Shi, Jiahai Yang, Han Zhang

    Abstract: Network utility maximization (NUM) is a well-studied problem for network traffic management and resource allocation. Because of the inherent decentralization and complexity of networks, most researches develop decentralized NUM algorithms.In recent years, the Software Defined Networking (SDN) architecture has been widely used, especially in cloud networks and inter-datacenter networks managed by l… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Journal ref: 2022 IEEE 30th International Conference on Network Protocols (ICNP), pp. 1-11

  35. arXiv:2408.08013  [pdf, other

    cs.CV

    Adaptive Learning of Consistency and Inconsistency Information for Fake News Detection

    Authors: Aohan Li, Jiaxin Chen, Xin Liao, Dengyong Zhang

    Abstract: The rapid advancement of social media platforms has significantly reduced the cost of information dissemination, yet it has also led to a proliferation of fake news, posing a threat to societal trust and credibility. Most of fake news detection research focused on integrating text and image information to represent the consistency of multiple modes in news content, while paying less attention to i… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  36. arXiv:2408.07969  [pdf, ps, other

    q-fin.MF

    The mean-variance portfolio selection based on the average and current profitability of the risky asset

    Authors: Yu Li, Yuhan Wu, Shuhua Zhang

    Abstract: We study the continuous-time pre-commitment mean-variance portfolio selection in a time-varying financial market. By introducing two indexes which respectively express the average profitability of the risky asset (AP) and the current profitability of the risky asset (CP), the optimal portfolio selection is represented by AP and CP. Furthermore, instead of the traditional maximum likelihood estimat… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  37. arXiv:2408.07967  [pdf, other

    cs.CV

    FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering

    Authors: Guofeng Feng, Siyan Chen, Rong Fu, Zimu Liao, Yi Wang, Tao Liu, Zhilin Pei, Hengjie Li, Xingcheng Zhang, Bo Dai

    Abstract: This work introduces FlashGS, an open-source CUDA Python library, designed to facilitate the efficient differentiable rasterization of 3D Gaussian Splatting through algorithmic and kernel-level optimizations. FlashGS is developed based on the observations from a comprehensive analysis of the rendering process to enhance computational efficiency and bring the technique to wide adoption. The paper i… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  38. arXiv:2408.07960  [pdf, other

    quant-ph

    Characterization of Intensity Correlation via Single-photon Detection in Quantum Key Distribution

    Authors: Tianyi Xing, Junxuan Liu, Likang Zhang, Min-Yan Wang, Yu-Huai Li, Ruiyin Liu, Qingquan Peng, Dongyang Wang, Yaxuan Wang, Hongwei Liu, Wei Li, Yuan Cao, Anqi Huang

    Abstract: One of the most significant vulnerabilities in the source unit of quantum key distribution (QKD) is the correlation between quantum states after modulation, which shall be characterized and evaluated for its practical security performance. In this work, we propose a methodology to characterize the intensity correlation according to the single-photon detection results in the measurement unit withou… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  39. arXiv:2408.07957  [pdf, other

    cs.MM

    Joint Optimization of Buffer Delay and HARQ for Video Communications

    Authors: Baoping Cheng, Peng Lei, Xiaoyan Xie, Tao Fu, Yukun Zhang, Xiaoming Tao

    Abstract: To improve the quality of experience (QoE) in video communication over lossy networks, this paper presents a transmission method that jointly optimizes buffer delay and Hybrid Automatic Repeat request (HARQ), referred to as BD-HARQ. This method operates on packet group and employs dynamic buffer delay combined with HARQ strategy for transmission. By defining the QoE based on metrics such as buffer… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 6 pages, 5figures

  40. arXiv:2408.07936  [pdf, other

    quant-ph math-ph

    A quantum-classical hybrid algorithm with Ising model for the learning with errors problem

    Authors: Muxi Zheng, Jinfeng Zeng, Wentao Yang, Pei-Jie Chang, Bao Yan, Haoran Zhang, Min Wang, Shijie Wei, Gui-Lu Long

    Abstract: The Learning-With-Errors (LWE) problem is a crucial computational challenge with significant implications for post-quantum cryptography and computational learning theory. Here we propose a quantum-classical hybrid algorithm with Ising model (HAWI) to address the LWE problem. Our approach involves transforming the LWE problem into the Shortest Vector Problem (SVP), using variable qubits to encode l… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  41. arXiv:2408.07935  [pdf, ps, other

    math.AP

    A new blowup criterion for the 3D barotropic compressible Navier-Stokes equations with vacuum

    Authors: Saiguo Xu, Yinghui Zhang

    Abstract: We investigate the blowup criterion of the barotropic compressible viscous fluids for the Cauchy problem, Dirichlet problem and Navier-slip boundary condition. The main novelty of this paper is two-fold: First, for the Cauchy problem and Dirichlet problem, we prove that a strong or smooth solution exists globally, provided that the vorticity of velocity satisfies Serrin's condition and the maximum… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 19 pages

  42. arXiv:2408.07931  [pdf, other

    cs.CV cs.AI cs.RO eess.IV

    Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning

    Authors: Haofeng Liu, Erli Zhang, Junde Wu, Mingxuan Hong, Yueming Jin

    Abstract: Surgical video segmentation is a critical task in computer-assisted surgery and is vital for enhancing surgical quality and patient outcomes. Recently, the Segment Anything Model 2 (SAM2) framework has shown superior advancements in image and video segmentation. However, SAM2 struggles with efficiency due to the high computational demands of processing high-resolution images and complex and long-r… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 16 pages, 2 figures

  43. AIE: Auction Information Enhanced Framework for CTR Prediction in Online Advertising

    Authors: Yang Yang, Bo Chen, Chenxu Zhu, Menghui Zhu, Xinyi Dai, Huifeng Guo, Muyu Zhang, Zhenhua Dong, Ruiming Tang

    Abstract: Click-Through Rate (CTR) prediction is a fundamental technique for online advertising recommendation and the complex online competitive auction process also brings many difficulties to CTR optimization. Recent studies have shown that introducing posterior auction information contributes to the performance of CTR prediction. However, existing work doesn't fully capitalize on the benefits of auction… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  44. arXiv:2408.07891  [pdf, other

    cs.CV cs.AI cs.LG

    Quantum-inspired Interpretable Deep Learning Architecture for Text Sentiment Analysis

    Authors: Bingyu Li, Da Zhang, Zhiyuan Zhao, Junyu Gao, Yuan Yuan

    Abstract: Text has become the predominant form of communication on social media, embedding a wealth of emotional nuances. Consequently, the extraction of emotional information from text is of paramount importance. Despite previous research making some progress, existing text sentiment analysis models still face challenges in integrating diverse semantic information and lack interpretability. To address thes… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  45. arXiv:2408.07887  [pdf, other

    cond-mat.mtrl-sci

    Topological Charge Quadrupole Protected by Spin-Orbit U(1) Quasi-Symmetry in Antiferromagnet NdBiPt

    Authors: Ao Zhang, Xiaobing Chen, Jiayu Li, Pengfei Liu, Yuntian Liu, Qihang Liu

    Abstract: The interplay of symmetry and topology in crystal solids has given rise to various elementary excitations as quasiparticles. Among these, those with significant Berry-phase-related transport responses are of particular interest. Here, we predict a new type of quasiparticle called topological charge quadruple (TCQ), which is analogous to a charge quadrupole but consists of two closely-packed pairs… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 6 pages, 4 figures

  46. arXiv:2408.07882  [pdf

    cond-mat.mes-hall quant-ph

    Anomalous thermodiffusion, absolute negative mobility and reverse heat transport in a single quantum dot

    Authors: Yanchao Zhang, Xiaolong Lü

    Abstract: We investigate the steady-state transport characteristics of a quantum dot system consisting of a single energy level embedded between two reservoirs under the influence of both the temperature gradient and bias voltage. Within tailored parameter regimes, the system can exhibit three counterintuitive transport phenomena of anomalous thermodiffusion, absolute negative mobility and reverse heat tran… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 16 pages, 9 figures

  47. arXiv:2408.07869  [pdf, other

    cs.LG

    A Systematic Evaluation of Generated Time Series and Their Effects in Self-Supervised Pretraining

    Authors: Audrey Der, Chin-Chia Michael Yeh, Xin Dai, Huiyuan Chen, Yan Zheng, Yujie Fan, Zhongfang Zhuang, Vivian Lai, Junpeng Wang, Liang Wang, Wei Zhang, Eamonn Keogh

    Abstract: Self-supervised Pretrained Models (PTMs) have demonstrated remarkable performance in computer vision and natural language processing tasks. These successes have prompted researchers to design PTMs for time series data. In our experiments, most self-supervised time series PTMs were surpassed by simple supervised models. We hypothesize this undesired phenomenon may be caused by data scarcity. In res… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: To appear in CIKM 2024 as a short paper; the version here is the self-contained version that includes the non-mandatory supplementary material available on the paper's companion website

  48. arXiv:2408.07820  [pdf, other

    cs.NI cs.IT eess.SY

    Hybrid Semantic/Bit Communication Based Networking Problem Optimization

    Authors: Le Xia, Yao Sun, Dusit Niyato, Lan Zhang, Lei Zhang, Muhammad Ali Imran

    Abstract: Semantic communication (SemCom) has recently shown great potential in significant resource savings and efficient information exchanges, thus naturally introducing a novel and practical next-generation cellular network paradigm where two modes of SemCom and conventional bit communication (BitCom) coexist, namely hybrid semantic/bit communication network (HSB-Net). Nevertheless, the pertinent wirele… ▽ More

    Submitted 30 July, 2024; originally announced August 2024.

    Comments: This paper has been accepted for publication in 2024 IEEE Global Communications Conference (GlobeCom 2024). Copyright may be transferred without notice, after which this version may no longer be accessible. arXiv admin note: substantial text overlap with arXiv:2404.04162

  49. arXiv:2408.07802  [pdf, other

    cs.LG cs.DC

    Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference

    Authors: Rohan Baskar Prabhakar, Hengrui Zhang, David Wentlzaff

    Abstract: Large Transformer networks are increasingly used in settings where low inference latency can improve the end-user experience and enable new applications. However, autoregressive inference is resource intensive and requires parallelism for efficiency. Parallelism introduces collective communication that is both expensive and represents a phase when hardware resources are underutilized. Towards miti… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  50. arXiv:2408.07752  [pdf, other

    quant-ph

    Hybrid entanglement and error correction in a scalable quantum network node

    Authors: Xiu-Ying Chang, Pan-Yu Hou, Wen-Gang Zhang, Xiang-Qian Meng, Ye-Fei Yu, Ya-Nan Lu, Yan-Qing Liu, Bin-Xiang Qi, Dong-Ling Deng, Lu-Ming Duan

    Abstract: Recent breakthroughs have ushered the quantum network into a new era, where quantum information can be stored, transferred, and processed across multiple nodes on a metropolitan scale. A key challenge in this new era is enhancing the capabilities of individual nodes, providing precise and robust control over multiple qubits and advanced functionality for scalable quantum networks. Here, we report… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 8 pages, 3 figures