Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 151–200 of 1,729 results for author: Yang, K

.
  1. arXiv:2403.05023  [pdf, other

    cs.CL cs.CV

    Towards Multimodal Sentiment Analysis Debiasing via Bias Purification

    Authors: Dingkang Yang, Mingcheng Li, Dongling Xiao, Yang Liu, Kun Yang, Zhaoyu Chen, Yuzheng Wang, Peng Zhai, Ke Li, Lihua Zhang

    Abstract: Multimodal Sentiment Analysis (MSA) aims to understand human intentions by integrating emotion-related clues from diverse modalities, such as visual, language, and audio. Unfortunately, the current MSA task invariably suffers from unplanned dataset biases, particularly multimodal utterance-level label bias and word-level context bias. These harmful biases potentially mislead models to focus on sta… ▽ More

    Submitted 5 July, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted by ECCV 2024

  2. arXiv:2403.05007  [pdf, other

    eess.SY

    Age of Computing: A Metric of Computation Freshness in Communication and Computation Cooperative Networks

    Authors: Xingran Chen, Yi Zhuang, Kun Yang

    Abstract: In communication and computation cooperative networks (3CNs), timely computation is crucial but not always guaranteed. There is a strong demand for a computational task to be completed within a given deadline. The time taken involves both processing time, communication time, and the impact of the deadline. However, a measure of such timeliness in 3CNs is lacking. In this paper, we introduce the no… ▽ More

    Submitted 24 July, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

  3. arXiv:2403.03506  [pdf, other

    cs.CL cs.AI

    Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and Insights

    Authors: Zijie Zeng, Shiqi Liu, Lele Sha, Zhuang Li, Kaixun Yang, Sannyuya Liu, Dragan Gašević, Guanliang Chen

    Abstract: This study explores the challenge of sentence-level AI-generated text detection within human-AI collaborative hybrid texts. Existing studies of AI-generated text detection for hybrid texts often rely on synthetic datasets. These typically involve hybrid texts with a limited number of boundaries. We contend that studies of detecting AI-generated content within hybrid texts should cover different ty… ▽ More

    Submitted 23 May, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Camera-Ready version of our IJCAI 2024 accepted paper (Special Track: AI and Social Good)

  4. arXiv:2403.03212  [pdf, other

    physics.ins-det hep-ex

    Performance of a modular ton-scale pixel-readout liquid argon time projection chamber

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

    Abstract: The Module-0 Demonstrator is a single-phase 600 kg liquid argon time projection chamber operated as a prototype for the DUNE liquid argon near detector. Based on the ArgonCube design concept, Module-0 features a novel 80k-channel pixelated charge readout and advanced high-coverage photon detection system. In this paper, we present an analysis of an eight-day data set consisting of 25 million cosmi… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 47 pages, 41 figures

    Report number: FERMILAB-PUB-24-0073-LBNF

  5. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  6. arXiv:2403.02814  [pdf, other

    cs.LG cs.AI

    InjectTST: A Transformer Method of Injecting Global Information into Independent Channels for Long Time Series Forecasting

    Authors: Ce Chi, Xing Wang, Kexin Yang, Zhiyan Song, Di Jin, Lin Zhu, Chao Deng, Junlan Feng

    Abstract: Transformer has become one of the most popular architectures for multivariate time series (MTS) forecasting. Recent Transformer-based MTS models generally prefer channel-independent structures with the observation that channel independence can alleviate noise and distribution drift issues, leading to more robustness. Nevertheless, it is essential to note that channel dependency remains an inherent… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  7. arXiv:2403.01789  [pdf, other

    cs.CR eess.SY

    DECOR: Enhancing Logic Locking Against Machine Learning-Based Attacks

    Authors: Yinghua Hu, Kaixin Yang, Subhajit Dutta Chowdhury, Pierluigi Nuzzo

    Abstract: Logic locking (LL) has gained attention as a promising intellectual property protection measure for integrated circuits. However, recent attacks, facilitated by machine learning (ML), have shown the potential to predict the correct key in multiple LL schemes by exploiting the correlation of the correct key value with the circuit structure. This paper presents a generic LL enhancement method based… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 8 pages. Accepted at the International Symposium on Quality Electronic Design (ISQED), 2024

  8. arXiv:2403.01738  [pdf, other

    cs.LG

    ComS2T: A complementary spatiotemporal learning system for data-adaptive model evolution

    Authors: Zhengyang Zhou, Qihe Huang, Binwu Wang, Jianpeng Hou, Kuo Yang, Yuxuan Liang, Yang Wang

    Abstract: Spatiotemporal (ST) learning has become a crucial technique to enable smart cities and sustainable urban development. Current ST learning models capture the heterogeneity via various spatial convolution and temporal evolution blocks. However, rapid urbanization leads to fluctuating distributions in urban data and city structures over short periods, resulting in existing methods suffering generaliz… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  9. arXiv:2403.01107  [pdf, other

    physics.optics physics.app-ph

    Octave-spanning Kerr soliton frequency combs in dispersion- and dissipation-engineered lithium niobate microresonators

    Authors: Yunxiang Song, Yaowen Hu, Xinrui Zhu, Kiyoul Yang, Marko Loncar

    Abstract: Dissipative Kerr solitons from optical microresonators, commonly referred to as soliton microcombs, have been developed for a broad range of applications, including precision measurement, optical frequency synthesis, and ultra-stable microwave and millimeter wave generation, all on a chip. An important goal for microcombs is self referencing, which requires octave-spanning bandwidths to detect and… ▽ More

    Submitted 25 May, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  10. arXiv:2403.00331  [pdf, other

    cs.DC

    WindGP: Efficient Graph Partitioning on Heterogenous Machines

    Authors: Li Zeng, Haohan Huang, Binfan Zheng, Kang Yang, Shengcheng Shao, Jinhua Zhou, Jun Xie, Rongqian Zhao, Xin Chen

    Abstract: Graph Partitioning is widely used in many real-world applications such as fraud detection and social network analysis, in order to enable the distributed graph computing on large graphs. However, existing works fail to balance the computation cost and communication cost on machines with different power (including computing capability, network bandwidth and memory size), as they only consider repli… ▽ More

    Submitted 6 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: 19 pages, 15 figures, 18 tables

  11. arXiv:2402.18393  [pdf, other

    cs.AI cs.NE cs.RO cs.SE

    Evaluating Decision Optimality of Autonomous Driving via Metamorphic Testing

    Authors: Mingfei Cheng, Yuan Zhou, Xiaofei Xie, Junjie Wang, Guozhu Meng, Kairui Yang

    Abstract: Autonomous Driving System (ADS) testing is crucial in ADS development, with the current primary focus being on safety. However, the evaluation of non-safety-critical performance, particularly the ADS's ability to make optimal decisions and produce optimal paths for autonomous vehicles (AVs), is equally vital to ensure the intelligence and reduce risks of AVs. Currently, there is little work dedica… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  12. arXiv:2402.18302  [pdf, other

    cs.CV cs.RO eess.AS eess.IV

    EchoTrack: Auditory Referring Multi-Object Tracking for Autonomous Driving

    Authors: Jiacheng Lin, Jiajun Chen, Kunyu Peng, Xuan He, Zhiyong Li, Rainer Stiefelhagen, Kailun Yang

    Abstract: This paper introduces the task of Auditory Referring Multi-Object Tracking (AR-MOT), which dynamically tracks specific objects in a video sequence based on audio expressions and appears as a challenging problem in autonomous driving. Due to the lack of semantic modeling capacity in audio and video, existing works have mainly focused on text-based multi-object tracking, which often comes at the cos… ▽ More

    Submitted 5 August, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Accepted to IEEE Transactions on Intelligent Transportation Systems (T-ITS). The source code and datasets are available at https://github.com/lab206/EchoTrack

  13. arXiv:2402.15481  [pdf, other

    cs.CL cs.CY

    Prejudice and Volatility: A Statistical Framework for Measuring Social Discrimination in Large Language Models

    Authors: Y Liu, K Yang, Z Qi, X Liu, Y Yu, C Zhai

    Abstract: This study investigates why and how inconsistency in the generation of Large Language Models (LLMs) might induce or exacerbate societal injustice. For instance, LLMs frequently exhibit contrasting gender stereotypes regarding the same career depending on varied contexts, highlighting the arguably harmful unpredictability of LLMs' behavioral patterns. To augment the existing discrimination assessme… ▽ More

    Submitted 24 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  14. arXiv:2402.15169  [pdf, ps, other

    cs.GT cs.DS cs.MA

    Platforms for Efficient and Incentive-Aware Collaboration

    Authors: Nika Haghtalab, Mingda Qiao, Kunhe Yang

    Abstract: Collaboration is crucial for reaching collective goals. However, its effectiveness is often undermined by the strategic behavior of individual agents -- a fact that is captured by a high Price of Stability (PoS) in recent literature [Blum et al., 2021]. Implicit in the traditional PoS analysis is the assumption that agents have full knowledge of how their tasks relate to one another. We offer a ne… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  15. arXiv:2402.14650  [pdf, other

    cs.CV

    GaussianPro: 3D Gaussian Splatting with Progressive Propagation

    Authors: Kai Cheng, Xiaoxiao Long, Kaizhi Yang, Yao Yao, Wei Yin, Yuexin Ma, Wenping Wang, Xuejin Chen

    Abstract: The advent of 3D Gaussian Splatting (3DGS) has recently brought about a revolution in the field of neural rendering, facilitating high-quality renderings at real-time speed. However, 3DGS heavily depends on the initialized point cloud produced by Structure-from-Motion (SfM) techniques. When tackling with large-scale scenes that unavoidably contain texture-less surfaces, the SfM techniques always f… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: See the project page for code, data: https://kcheng1021.github.io/gaussianpro.github.io

  16. arXiv:2402.13934  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Do Efficient Transformers Really Save Computation?

    Authors: Kai Yang, Jan Ackermann, Zhenyu He, Guhao Feng, Bohang Zhang, Yunzhen Feng, Qiwei Ye, Di He, Liwei Wang

    Abstract: As transformer-based language models are trained on increasingly large datasets and with vast numbers of parameters, finding more efficient alternatives to the standard Transformer has become very valuable. While many efficient Transformers and Transformer alternatives have been proposed, none provide theoretical guarantees that they are a suitable replacement for the standard Transformer. This ma… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

  17. arXiv:2402.12659  [pdf, other

    cs.CL cs.AI cs.CE

    FinBen: A Holistic Financial Benchmark for Large Language Models

    Authors: Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu , et al. (9 additional authors not shown)

    Abstract: LLMs have transformed NLP and shown promise in various fields, yet their potential in finance is underexplored due to a lack of comprehensive evaluation benchmarks, the rapid development of LLMs, and the complexity of financial tasks. In this paper, we introduce FinBen, the first extensive open-source evaluation benchmark, including 36 datasets spanning 24 financial tasks, covering seven critical… ▽ More

    Submitted 18 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 26 pages, 11 figures

  18. arXiv:2402.12620  [pdf, other

    cs.CY

    Are Large Language Models (LLMs) Good Social Predictors?

    Authors: Kaiqi Yang, Hang Li, Hongzhi Wen, Tai-Quan Peng, Jiliang Tang, Hui Liu

    Abstract: The prediction has served as a crucial scientific method in modern social studies. With the recent advancement of Large Language Models (LLMs), efforts have been made to leverage LLMs to predict the human features in social life, such as presidential voting. These works suggest that LLMs are capable of generating human-like responses. However, we find that the promising performance achieved by pre… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  19. arXiv:2402.11669  [pdf, other

    physics.optics physics.app-ph

    Hybrid Kerr-electro-optic frequency combs on thin-film lithium niobate

    Authors: Yunxiang Song, Yaowen Hu, Marko Lončar, Kiyoul Yang

    Abstract: Optical frequency combs are indispensable links between the optical and microwave domains, enabling a wide range of applications including precision spectroscopy, ultrastable frequency generation, and timekeeping. Chip-scale integration miniaturizes bulk implementations onto photonic chips, offering highly compact, stable, and power-efficient frequency comb sources. State of the art integrated fre… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  20. arXiv:2402.11060  [pdf, other

    cs.CL cs.AI cs.IR

    Persona-DB: Efficient Large Language Model Personalization for Response Prediction with Collaborative Data Refinement

    Authors: Chenkai Sun, Ke Yang, Revanth Gangi Reddy, Yi R. Fung, Hou Pong Chan, Kevin Small, ChengXiang Zhai, Heng Ji

    Abstract: The increasing demand for personalized interactions with large language models (LLMs) calls for methodologies capable of accurately and efficiently identifying user opinions and preferences. Retrieval augmentation emerges as an effective strategy, as it can accommodate a vast number of users without the costs from fine-tuning. Existing research, however, has largely focused on enhancing the retrie… ▽ More

    Submitted 20 August, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  21. arXiv:2402.10197  [pdf, ps, other

    math.PR math-ph

    Bulk universality for complex eigenvalues of real non-symmetric random matrices with i.i.d. entries

    Authors: Sofiia Dubova, Kevin Yang

    Abstract: We consider an ensemble of non-Hermitian matrices with independent identically distributed real entries that have finite moments. We show that its $k$-point correlation function in the bulk away from the real line converges to a universal limit.

    Submitted 26 April, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: 67 pages, revised version, updated references

    MSC Class: 60B20; 15B52

  22. arXiv:2402.09723  [pdf, other

    stat.ML cs.AI cs.CL cs.LG

    Efficient Prompt Optimization Through the Lens of Best Arm Identification

    Authors: Chengshuai Shi, Kun Yang, Zihan Chen, Jundong Li, Jing Yang, Cong Shen

    Abstract: The remarkable instruction-following capability of large language models (LLMs) has sparked a growing interest in automatically finding good prompts, i.e., prompt optimization. Most existing works follow the scheme of selecting from a pre-generated pool of candidate prompts. However, these designs mainly focus on the generation strategy, while limited attention has been paid to the selection metho… ▽ More

    Submitted 30 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  23. arXiv:2402.07747  [pdf, ps, other

    math.ST stat.ML

    Optimal score estimation via empirical Bayes smoothing

    Authors: Andre Wibisono, Yihong Wu, Kaylee Yingxi Yang

    Abstract: We study the problem of estimating the score function of an unknown probability distribution $ρ^*$ from $n$ independent and identically distributed observations in $d$ dimensions. Assuming that $ρ^*$ is subgaussian and has a Lipschitz-continuous score function $s^*$, we establish the optimal rate of $\tilde Θ(n^{-\frac{2}{d+4}})$ for this estimation problem under the loss function… ▽ More

    Submitted 12 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: COLT 2024; added the new results on extending to beta-Holder scores with beta <= 1

  24. arXiv:2402.07426  [pdf, ps, other

    cs.GT

    Computational Aspects of Bayesian Persuasion under Approximate Best Response

    Authors: Kunhe Yang, Hanrui Zhang

    Abstract: We study Bayesian persuasion under approximate best response, where the receiver may choose any action that is not too much suboptimal given their posterior belief upon receiving the signal. We focus on the computational aspects of the problem, aiming to design algorithms that efficiently compute (almost) optimal strategies for the sender. Despite the absence of the revelation principle -- which h… ▽ More

    Submitted 13 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  25. arXiv:2402.06299  [pdf, other

    cs.NE cs.AI

    A Functional Analysis Approach to Symbolic Regression

    Authors: Kirill Antonov, Roman Kalkreuth, Kaifeng Yang, Thomas Bäck, Niki van Stein, Anna V Kononova

    Abstract: Symbolic regression (SR) poses a significant challenge for randomized search heuristics due to its reliance on the synthesis of expressions for input-output mappings. Although traditional genetic programming (GP) algorithms have achieved success in various domains, they exhibit limited performance when tree-based representations are used for SR. To address these limitations, we introduce a novel S… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 14 pages, 3 figures. Submitted to Genetic and Evolutionary Computation Conference (GECCO-2024)

  26. arXiv:2402.04409  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Towards Fair, Robust and Efficient Client Contribution Evaluation in Federated Learning

    Authors: Meiying Zhang, Huan Zhao, Sheldon Ebron, Kan Yang

    Abstract: The performance of clients in Federated Learning (FL) can vary due to various reasons. Assessing the contributions of each client is crucial for client selection and compensation. It is challenging because clients often have non-independent and identically distributed (non-iid) data, leading to potentially noisy or divergent updates. The risk of malicious clients amplifies the challenge especially… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  27. arXiv:2402.04303  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Broken Symmetry in Ideal Chern Bands

    Authors: Hui Liu, Kang Yang, Ahmed Abouelkomsan, Zhao Liu, Emil J. Bergholtz

    Abstract: Recent observations of the fractional anomalous quantum Hall effect in moiré materials have reignited the interest in fractional Chern insulators (FCIs). The chiral limit in which analytic Landau level-like single-particle states form an "ideal" Chern band and local interactions lead to Laughlin-like FCIs at $1/3$ filling, has been very useful for understanding these systems by relating them to th… ▽ More

    Submitted 26 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 5 pages + 4 figures, comments are welcome!

  28. "Life" of dust originating from the irregular satellites of Jupiter

    Authors: Zhenghan Chen, Kun Yang, Xiaodong Liu

    Abstract: The irregular satellites of Jupiter produce dust particles through the impact of interplanetary micrometeoroids. In this paper, the dynamics of these particles is studied by both high-accuracy numerical simulation and analytical theory, in order to learn their transport, final fate, and spatial distribution. The perturbation forces that are considered in our dynamical model include the solar radia… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 11 pages, 14 figures

    Journal ref: Monthly Notices of the Royal Astronomical Society, 2024, 527(4): 11327-11337

  29. arXiv:2402.02916  [pdf, ps, other

    math.AP

    On bilinear Strichartz estimates on waveguides with applications

    Authors: Yangkendi Deng, Chenjie Fan, Kailong Yang, Zehua Zhao, Jiqiang Zheng

    Abstract: We study local-in-time and global-in-time bilinear Strichartz estimates for the Schrödinger equation on waveguides. As applications, we apply those estimates to study global well-posedness of nonlinear Schrödinger equations on these waveguides.

    Submitted 29 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 23 pages, 2 figures

  30. arXiv:2402.01728  [pdf, other

    cs.CL cs.AI cs.AR

    Hardware Phi-1.5B: A Large Language Model Encodes Hardware Domain Specific Knowledge

    Authors: Weimin Fu, Shijie Li, Yifang Zhao, Haocheng Ma, Raj Dutta, Xuan Zhang, Kaichen Yang, Yier Jin, Xiaolong Guo

    Abstract: In the rapidly evolving semiconductor industry, where research, design, verification, and manufacturing are intricately linked, the potential of Large Language Models to revolutionize hardware design and security verification is immense. The primary challenge, however, lies in the complexity of hardware specific issues that are not adequately addressed by the natural language or software code know… ▽ More

    Submitted 27 January, 2024; originally announced February 2024.

    Comments: 6 pages, 6 figures

    Journal ref: 29th IEEE/ACM Asia and South Pacific Design Automation Conference (ASP-DAC); 2024 January; Incheon Songdo Convensia, South Korea

  31. arXiv:2402.01568  [pdf, other

    physics.ins-det

    Doping Liquid Argon with Xenon in ProtoDUNE Single-Phase: Effects on Scintillation Light

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, H. Amar Es-sghir, P. Amedo, J. Anderson, D. A. Andrade, C. Andreopoulos , et al. (1297 additional authors not shown)

    Abstract: Doping of liquid argon TPCs (LArTPCs) with a small concentration of xenon is a technique for light-shifting and facilitates the detection of the liquid argon scintillation light. In this paper, we present the results of the first doping test ever performed in a kiloton-scale LArTPC. From February to May 2020, we carried out this special run in the single-phase DUNE Far Detector prototype (ProtoDUN… ▽ More

    Submitted 2 August, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: 36 pages, 20 figures. Corrected author list; corrected typos across paper and polished text

    Report number: CERN-EP-2024-024; FERMILAB-PUB-23-0819-LBNF

  32. arXiv:2402.00744  [pdf, other

    cs.SD cs.CL eess.AS

    BATON: Aligning Text-to-Audio Model with Human Preference Feedback

    Authors: Huan Liao, Haonan Han, Kai Yang, Tianjiao Du, Rui Yang, Zunnan Xu, Qinmei Xu, Jingquan Liu, Jiasheng Lu, Xiu Li

    Abstract: With the development of AI-Generated Content (AIGC), text-to-audio models are gaining widespread attention. However, it is challenging for these models to generate audio aligned with human preference due to the inherent information density of natural language and limited model understanding ability. To alleviate this issue, we formulate the BATON, a framework designed to enhance the alignment betw… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  33. arXiv:2401.17837  [pdf, ps, other

    eess.SY

    Safe Reinforcement Learning-Based Eco-Driving Control for Mixed Traffic Flows With Disturbances

    Authors: Ke Lu, Dongjun Li, Qun Wang, Kaidi Yang, Lin Zhao, Ziyou Song

    Abstract: This paper presents a safe learning-based eco-driving framework tailored for mixed traffic flows, which aims to optimize energy efficiency while guaranteeing safety during real-system operations. Even though reinforcement learning (RL) is capable of optimizing energy efficiency in intricate environments, it is challenged by safety requirements during the training process. The lack of safety guaran… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  34. arXiv:2401.16923  [pdf, other

    cs.CV cs.RO eess.IV

    Fourier Prompt Tuning for Modality-Incomplete Scene Segmentation

    Authors: Ruiping Liu, Jiaming Zhang, Kunyu Peng, Yufan Chen, Ke Cao, Junwei Zheng, M. Saquib Sarfraz, Kailun Yang, Rainer Stiefelhagen

    Abstract: Integrating information from multiple modalities enhances the robustness of scene perception systems in autonomous vehicles, providing a more comprehensive and reliable sensory framework. However, the modality incompleteness in multi-modal segmentation remains under-explored. In this work, we establish a task called Modality-Incomplete Scene Segmentation (MISS), which encompasses both system-level… ▽ More

    Submitted 10 April, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to IEEE IV 2024. The source code is publicly available at https://github.com/RuipingL/MISS

  35. arXiv:2401.16712  [pdf, other

    cs.CV cs.RO eess.IV

    LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras

    Authors: Fei Teng, Jiaming Zhang, Jiawei Liu, Kunyu Peng, Xina Cheng, Zhiyong Li, Kailun Yang

    Abstract: Leveraging rich information is crucial for dense prediction tasks. Light field (LF) cameras are instrumental in this regard, as they allow data to be sampled from various perspectives. This capability provides valuable spatial, depth, and angular information, enhancing scene-parsing tasks. However, we have identified two overlooked issues for the LF salient object detection (SOD) task. (1): Previo… ▽ More

    Submitted 26 August, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted to ICPR 2024. The source code is publicly available at: https://github.com/FeiBryantkit/LF-Tracy

  36. arXiv:2401.16700  [pdf, other

    cs.CV cs.RO eess.IV

    Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers

    Authors: Jianbin Jiao, Xina Cheng, Weijie Chen, Xiaoting Yin, Hao Shi, Kailun Yang

    Abstract: 3D human pose estimation captures the human joint points in three-dimensional space while keeping the depth information and physical structure. That is essential for applications that require precise pose information, such as human-computer interaction, scene understanding, and rehabilitation training. Due to the challenges in data collection, mainstream datasets of 3D human pose estimation are pr… ▽ More

    Submitted 25 March, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted to IJCNN 2024. The source code will be available at https://github.com/WUJINHUAN/3D-human-pose

  37. LLM4SecHW: Leveraging Domain Specific Large Language Model for Hardware Debugging

    Authors: Weimin Fu, Kaichen Yang, Raj Gautam Dutta, Xiaolong Guo, Gang Qu

    Abstract: This paper presents LLM4SecHW, a novel framework for hardware debugging that leverages domain specific Large Language Model (LLM). Despite the success of LLMs in automating various software development tasks, their application in the hardware security domain has been limited due to the constraints of commercial LLMs and the scarcity of domain specific data. To address these challenges, we propose… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: 6 pages. 1 figure

    Journal ref: 2023 Asian Hardware Oriented Security and Trust Symposium (AsianHOST), Tianjin, China, 2023, pp. 1-6

  38. arXiv:2401.16421  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Two Stones Hit One Bird: Bilevel Positional Encoding for Better Length Extrapolation

    Authors: Zhenyu He, Guhao Feng, Shengjie Luo, Kai Yang, Liwei Wang, Jingjing Xu, Zhi Zhang, Hongxia Yang, Di He

    Abstract: In this work, we leverage the intrinsic segmentation of language sequences and design a new positional encoding method called Bilevel Positional Encoding (BiPE). For each position, our BiPE blends an intra-segment encoding and an inter-segment encoding. The intra-segment encoding identifies the locations within a segment and helps the model capture the semantic information therein via absolute pos… ▽ More

    Submitted 17 June, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: 17 pages, 7 figures, 8 tables; ICML 2024 Camera Ready version; Code: https://github.com/zhenyuhe00/BiPE

  39. Superexchange interactions and magnetic anisotropy in MnPSe$_3$ monolayer

    Authors: Guangyu Wang, Ke Yang, Yaozhenghang Ma, Lu Liu, Di Lu, Yuxuan Zhou, Hua Wu

    Abstract: Two-dimensional van der Waals magnetic materials are of great current interest for their promising applications in spintronics. In this work, using density functional theory calculations in combination with the maximally localized Wannier functions method and the magnetic anisotropy analyses, we study the electronic and magnetic properties of MnPSe$_3$ monolayer. Our results show that it is a char… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 8 pages, 9 figures

    Journal ref: Chinese Phys. Lett. 40 077301 (2023)

  40. arXiv:2401.15561  [pdf, other

    eess.SY cs.RO

    A Parameter Privacy-Preserving Strategy for Mixed-Autonomy Platoon Control

    Authors: Jingyuan Zhou, Kaidi Yang

    Abstract: It has been demonstrated that leading cruise control (LCC) can improve the operation of mixed-autonomy platoons by allowing connected and automated vehicles (CAVs) to make longitudinal control decisions based on the information provided by surrounding vehicles. However, LCC generally requires surrounding human-driven vehicles (HDVs) to share their real-time states, which can be used by adversaries… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  41. arXiv:2401.12888  [pdf, other

    cs.RO cs.CV

    Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Mining, and Closed-Loop Technologies

    Authors: Lincan Li, Wei Shao, Wei Dong, Yijun Tian, Qiming Zhang, Kaixiang Yang, Wenjie Zhang

    Abstract: The aspiration of the next generation's autonomous driving (AD) technology relies on the dedicated integration and interaction among intelligent perception, prediction, planning, and low-level control. There has been a huge bottleneck regarding the upper bound of autonomous driving algorithm performance, a consensus from academia and industry believes that the key to surmount the bottleneck lies i… ▽ More

    Submitted 26 January, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  42. arXiv:2401.12852  [pdf, other

    cs.RO

    Control-Aware Trajectory Predictions for Communication-Efficient Drone Swarm Coordination in Cluttered Environments

    Authors: Longhao Yan, Jingyuan Zhou, Kaidi Yang

    Abstract: Swarms of Unmanned Aerial Vehicles (UAV) have demonstrated enormous potential in many industrial and commercial applications. However, before deploying UAVs in the real world, it is essential to ensure they can operate safely in complex environments, especially with limited communication capabilities. To address this challenge, we propose a control-aware learning-based trajectory prediction algori… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 15 pages, 15 figures, submitted to IEEE Transactions on Intelligent Vehicles

    ACM Class: I.2.9

  43. arXiv:2401.11836  [pdf, other

    cs.LG cs.CR eess.SY

    Privacy-Preserving Data Fusion for Traffic State Estimation: A Vertical Federated Learning Approach

    Authors: Qiqing Wang, Kaidi Yang

    Abstract: This paper proposes a privacy-preserving data fusion method for traffic state estimation (TSE). Unlike existing works that assume all data sources to be accessible by a single trusted party, we explicitly address data privacy concerns that arise in the collaboration and data sharing between multiple data owners, such as municipal authorities (MAs) and mobility providers (MPs). To this end, we prop… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  44. arXiv:2401.11409  [pdf, other

    cs.IT eess.SP

    Robust Beamforming for Downlink Multi-Cell Systems: A Bilevel Optimization Perspective

    Authors: Xingdi Chen, Yu Xiong, Kai Yang

    Abstract: Utilization of inter-base station cooperation for information processing has shown great potential in enhancing the overall quality of communication services (QoS) in wireless communication networks. Nevertheless, such cooperations require the knowledge of channel state information (CSI) at base stations (BSs), which is assumed to be perfectly known. However, CSI errors are inevitable in practice… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

    Comments: accepted at AAAI2024

  45. Enhancing System-Level Safety in Mixed-Autonomy Platoon via Safe Reinforcement Learning

    Authors: Jingyuan Zhou, Longhao Yan, Kaidi Yang

    Abstract: Connected and automated vehicles (CAVs) have recently gained prominence in traffic research due to advances in communication technology and autonomous driving. Various longitudinal control strategies for CAVs have been developed to enhance traffic efficiency, stability, and safety in mixed-autonomy scenarios. Deep reinforcement learning (DRL) is one promising strategy for mixed-autonomy platoon co… ▽ More

    Submitted 1 March, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: IEEE Transactions on Intelligent Vehicles (2024)

  46. arXiv:2401.09793  [pdf, other

    cs.LG

    PatchAD: A Lightweight Patch-based MLP-Mixer for Time Series Anomaly Detection

    Authors: Zhijie Zhong, Zhiwen Yu, Yiyuan Yang, Weizheng Wang, Kaixiang Yang

    Abstract: Anomaly detection in time series analysis is a pivotal task, yet it poses the challenge of discerning normal and abnormal patterns in label-deficient scenarios. While prior studies have largely employed reconstruction-based approaches, which limits the models' representational capacities. Moreover, existing deep learning-based methods are not sufficiently lightweight. Addressing these issues, we p… ▽ More

    Submitted 28 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 22 pages, 11 figures, 14 tables, Under review

  47. arXiv:2401.09750  [pdf, other

    cs.LG

    Exploration and Anti-Exploration with Distributional Random Network Distillation

    Authors: Kai Yang, Jian Tao, Jiafei Lyu, Xiu Li

    Abstract: Exploration remains a critical issue in deep reinforcement learning for an agent to attain high returns in unknown environments. Although the prevailing exploration Random Network Distillation (RND) algorithm has been demonstrated to be effective in numerous environments, it often needs more discriminative power in bonus allocation. This paper highlights the "bonus inconsistency" issue within RND,… ▽ More

    Submitted 19 May, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: ICML 2024 accepted

  48. arXiv:2401.09549  [pdf, other

    cond-mat.mes-hall

    Interferometric Single-Shot Parity Measurement in an InAs-Al Hybrid Device

    Authors: Morteza Aghaee, Alejandro Alcaraz Ramirez, Zulfi Alam, Rizwan Ali, Mariusz Andrzejczuk, Andrey Antipov, Mikhail Astafev, Amin Barzegar, Bela Bauer, Jonathan Becker, Umesh Kumar Bhaskar, Alex Bocharov, Srini Boddapati, David Bohn, Jouri Bommer, Leo Bourdet, Arnaud Bousquet, Samuel Boutin, Lucas Casparis, Benjamin James Chapman, Sohail Chatoor, Anna Wulff Christensen, Cassandra Chua, Patrick Codd, William Cole , et al. (137 additional authors not shown)

    Abstract: The fusion of non-Abelian anyons or topological defects is a fundamental operation in measurement-only topological quantum computation. In topological superconductors, this operation amounts to a determination of the shared fermion parity of Majorana zero modes. As a step towards this, we implement a single-shot interferometric measurement of fermion parity in indium arsenide-aluminum heterostruct… ▽ More

    Submitted 2 April, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: Added data on a second measurement of device A and a measurement of device B, expanded discussion of a trivial scenario. Refs added, author list updated

  49. EmoLLMs: A Series of Emotional Large Language Models and Annotation Tools for Comprehensive Affective Analysis

    Authors: Zhiwei Liu, Kailai Yang, Tianlin Zhang, Qianqian Xie, Sophia Ananiadou

    Abstract: Sentiment analysis and emotion detection are important research topics in natural language processing (NLP) and benefit many downstream tasks. With the widespread application of LLMs, researchers have started exploring the application of LLMs based on instruction-tuning in the field of sentiment analysis. However, these models only focus on single aspects of affective classification tasks (e.g. se… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted by KDD 2024

  50. arXiv:2401.08207  [pdf, other

    astro-ph.HE astro-ph.GA

    AGN jet-inflated bubbles as possible origin of odd radio circles

    Authors: Yen-Hsing Lin, H. -Y. Karen Yang

    Abstract: Odd radio circles (ORCs) are newly discovered extragalactic radio objects with unknown origin. In this work, we carry out three-dimensional cosmic-ray (CR) magnetohydrodynamic simulations using the FLASH code and predict the radio morphology of end-on active galactic nucleus (AGN) jet-inflated bubbles considering hadronic emission. We consider CR proton (CRp)-dominated jets as they tend to inflate… ▽ More

    Submitted 21 August, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 17 pages, 10 figures, ApJ accepted