Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 770 results for author: Zheng, L

.
  1. arXiv:2407.10431  [pdf, other

    astro-ph.HE astro-ph.IM gr-qc

    Coport: A New Public Code for Polarized Radiative Transfer in a Covariant Framework$^\spadesuit$

    Authors: Jiewei Huang, Liheng Zheng, Minyong Guo, Bin Chen

    Abstract: General relativistic radiative transfer calculations are essential for comparing theoretical models of black hole accretion flows and jets with observational data. In this work, we introduce Coport, a novel public code specifically designed for covariant polarized ray-tracing radiative transfer computations in any spacetime. Written in Julia, Coport includes an interface for visualizing numerical… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 27 pages, 6 figures;

  2. arXiv:2407.09466  [pdf, other

    cs.RO cs.GR

    TRAVERSE: Traffic-Responsive Autonomous Vehicle Experience & Rare-event Simulation for Enhanced safety

    Authors: Sandeep Thalapanane, Sandip Sharan Senthil Kumar, Guru Nandhan Appiya Dilipkumar Peethambari, Sourang SriHari, Laura Zheng, Julio Poveda, Ming C. Lin

    Abstract: Data for training learning-enabled self-driving cars in the physical world are typically collected in a safe, normal environment. Such data distribution often engenders a strong bias towards safe driving, making self-driving cars unprepared when encountering adversarial scenarios like unexpected accidents. Due to a dearth of such adverse data that is unrealistic for drivers to collect, autonomous… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2407.08981  [pdf

    eess.SY

    Joint Load and Capacity Scheduling for Flexible Radio Resource Management of High-Throughput Satellites

    Authors: Jia Zhuoya, Xiong Wei, Hao Hongxing, Liu Zheng, Han Chi

    Abstract: This work first explores using flexible beam-user mapping to optimize the beam service range and beam position, in order to adapt the non-uniform traffic demand to offer in high-throughput satellite (HTS) systems. Second, on this basis, the joint flexible bandwidth allocation is adopted to adapt the offer to demand at the same time. This strategy allows both beam capacity and load to be adjusted t… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  4. arXiv:2407.08529  [pdf, other

    cs.CR

    Enhancing Privacy of Spatiotemporal Federated Learning against Gradient Inversion Attacks

    Authors: Lele Zheng, Yang Cao, Renhe Jiang, Kenjiro Taura, Yulong Shen, Sheng Li, Masatoshi Yoshikawa

    Abstract: Spatiotemporal federated learning has recently raised intensive studies due to its ability to train valuable models with only shared gradients in various location-based services. On the other hand, recent studies have shown that shared gradients may be subject to gradient inversion attacks (GIA) on images or texts. However, so far there has not been any systematic study of the gradient inversion a… ▽ More

    Submitted 15 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by DASFAA 2024, 16 pages

  5. arXiv:2407.08085  [pdf, other

    hep-ex astro-ph.CO physics.ins-det

    Light Dark Matter Constraints from SuperCDMS HVeV Detectors Operated Underground with an Anticoincidence Event Selection

    Authors: SuperCDMS Collaboration, M. F. Albakry, I. Alkhatib, D. Alonso-González, D. W. P. Amaral, J. Anczarski, T. Aralis, T. Aramaki, I. J. Arnquist, I. Ataee Langroudy, E. Azadbakht, C. Bathurst, R. Bhattacharyya, A. J. Biffl, P. L. Brink, M. Buchanan, R. Bunker, B. Cabrera, R. Calkins, R. A. Cameron, C. Cartaro, D. G. Cerdeño, Y. -Y. Chang, M. Chaudhuri, J. -H. Chen , et al. (116 additional authors not shown)

    Abstract: This article presents constraints on dark-matter-electron interactions obtained from the first underground data-taking campaign with multiple SuperCDMS HVeV detectors operated in the same housing. An exposure of 7.63 g-days is used to set upper limits on the dark-matter-electron scattering cross section for dark matter masses between 0.5 and 1000 MeV/$c^2$, as well as upper limits on dark photon k… ▽ More

    Submitted 12 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 7 pages + title and references, 4 figures, and 1 table

  6. arXiv:2407.07443  [pdf, other

    cs.AI

    Secondary Structure-Guided Novel Protein Sequence Generation with Latent Graph Diffusion

    Authors: Yutong Hu, Yang Tan, Andi Han, Lirong Zheng, Liang Hong, Bingxin Zhou

    Abstract: The advent of deep learning has introduced efficient approaches for de novo protein sequence design, significantly improving success rates and reducing development costs compared to computational or experimental methods. However, existing methods face challenges in generating proteins with diverse lengths and shapes while maintaining key structural features. To address these challenges, we introdu… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 10 pages, 4 figures

  7. arXiv:2407.06951  [pdf, other

    cs.RO

    RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios

    Authors: Liming Zheng, Feng Yan, Fanfan Liu, Chengjian Feng, Zhuoliang Kang, Lin Ma

    Abstract: Foundation models hold significant potential for enabling robots to perform long-horizon general manipulation tasks. However, the simplicity of tasks and the uniformity of environments in existing benchmarks restrict their effective deployment in complex scenarios. To address this limitation, this paper introduces the \textit{RoboCAS} benchmark, the first benchmark specifically designed for comple… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  8. arXiv:2407.06595  [pdf, ps, other

    math.GR

    Classifying prime graphs of finite groups -- a methodical approach

    Authors: Thomas Michael Keller, Gavin Pettigrew, Saskia Solotko, Lixin Zheng

    Abstract: For a finite group $G$, the vertices of the prime graph $Γ(G)$ are the primes that divide $|G|$, and two vertices $p$ and $q$ are connected by an edge if and only if there is an element of order $pq$ in $G$. Prime graphs of solvable groups as well as groups whose noncyclic composition factors have order divisible by exactly three distinct primes have been classified in graph-theoretic terms. In th… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 40 pages, 4 figures

    MSC Class: 20D60 (primary) 05C25 (secondary)

  9. arXiv:2407.03816  [pdf

    physics.optics

    Compact ultra-broadband light coupling on chip via nonadiabatic pumping

    Authors: Weiwei Liu, Chijun Li, Bing Wang, Tianyan Chai, Lingzhi Zheng, Zhuoxiong Liu, Haoru Zhang, Shuaifei Ren, Xiaohong Li, Cheng Zeng, Jinsong Xia, Peixiang Lu

    Abstract: Enlarging bandwidth capacity of the integrated photonic systems demands efficient and broadband light coupling among optical elements, which has been a vital issue in integrated photonics. Here, we have developed a compact ultra-broadband light coupling strategy based on nonadiabatic pumping in coupled optical waveguides, and experimentally demonstrated the designs in thin-film lithium niobate on… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  10. arXiv:2407.02124  [pdf

    eess.SY

    Data-Driven Subsynchronous Oscillation Suppression for Renewable Energy Integrated Power Systems Based on Koopman Operator

    Authors: Zihan Wang, Ziyang Huang, Xiaonan Zhang, Gengyin Li, Le Zheng

    Abstract: Recently, subsynchronous oscillations (SSOs) have emerged frequently worldwide, with the high penetration of renewable power generation in modern power systems. The SSO introduced by renewables has become a prominent new stability problem, seriously threatening the stable operation of systems. This paper proposes a data-driven dynamic optimal controller for renewable energy integrated power system… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  11. arXiv:2407.01601  [pdf, other

    cs.LG cs.AI

    Unveiling and Controlling Anomalous Attention Distribution in Transformers

    Authors: Ruiqing Yan, Xingbo Du, Haoyu Deng, Linghan Zheng, Qiuzhuang Sun, Jifang Hu, Yuhang Shao, Penghao Jiang, Jinrong Jiang, Lian Zhao

    Abstract: With the advent of large models based on the Transformer architecture, researchers have observed an anomalous phenomenon in the Attention mechanism--there is a very high attention on the first element, which is prevalent across Transformer-based models. It is crucial to understand it for the development of techniques focusing on attention distribution, such as Key-Value (KV) Cache compression and… ▽ More

    Submitted 3 July, 2024; v1 submitted 26 June, 2024; originally announced July 2024.

  12. arXiv:2406.19755  [pdf, other

    q-bio.QM cs.AI

    Protein Representation Learning with Sequence Information Embedding: Does it Always Lead to a Better Performance?

    Authors: Yang Tan, Lirong Zheng, Bozitao Zhong, Liang Hong, Bingxin Zhou

    Abstract: Deep learning has become a crucial tool in studying proteins. While the significance of modeling protein structure has been discussed extensively in the literature, amino acid types are typically included in the input as a default operation for many inference tasks. This study demonstrates with structure alignment task that embedding amino acid types in some cases may not help a deep learning mode… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures

  13. arXiv:2406.18977  [pdf, other

    cs.RO cs.CL cs.CV

    RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulaiton

    Authors: Fanfan Liu, Feng Yan, Liming Zheng, Chengjian Feng, Yiyang Huang, Lin Ma

    Abstract: Utilizing Vision-Language Models (VLMs) for robotic manipulation represents a novel paradigm, aiming to enhance the model's ability to generalize to new objects and instructions. However, due to variations in camera specifications and mounting positions, existing methods exhibit significant performance disparities across different robotic platforms. To address this challenge, we propose RoboUniVie… ▽ More

    Submitted 12 July, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

  14. arXiv:2406.17340  [pdf, other

    astro-ph.SR

    Two Dynamically Discovered Compact Object Candidate Binary Systems from LAMOST Low-resolution Survey

    Authors: Senyu Qi, Wei-Min Gu, Zhi-Xiang Zhang, Tuan Yi, Jin-Zhong Liu, Ling-Lin Zheng

    Abstract: We report two binary systems, LAMOST J035540+381550 (hereafter J035540) and LAMOST J035916+400732 (hereafter J035916), identified through the Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) low-resolution survey (LRS). Each of these two systems contains an M-type star orbiting with a invisible compact object candidate. Follow-up spectroscopic observations of Palomar 200-inch tel… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 12 pages, 9 figures, accepted for publication in MNRAS

  15. arXiv:2406.16868  [pdf, other

    eess.SP cs.AI

    Neural Network-based Two-Dimensional Filtering for OTFS Symbol Detection

    Authors: Jiarui Xu, Karim Said, Lizhong Zheng, Lingjia Liu

    Abstract: Orthogonal time frequency space (OTFS) is a promising modulation scheme for wireless communication in high-mobility scenarios. Recently, a reservoir computing (RC) based approach has been introduced for online subframe-based symbol detection in the OTFS system, where only the limited over-the-air (OTA) pilot symbols are utilized for training. However, the previous RC-based approach does not design… ▽ More

    Submitted 8 March, 2024; originally announced June 2024.

    Comments: 6 pages, conference paper. arXiv admin note: substantial text overlap with arXiv:2311.08543

  16. arXiv:2406.16711  [pdf

    eess.SY

    Generalized Modal Analysis in Power System with High CIG Penetration: Concept and Quantitative Assessment

    Authors: Le Zheng, Jiajie Zheng, Chongru Liu

    Abstract: This paper presents a Generalized Modal Analysis (GMA) concept for the small-signal stability analysis of power systems with high penetration of Converter-Interfaced Generation (CIG). GMA quantitatively assesses interactions between various elements in the power system, offering intuitive and transparent physical interpretations. The method's versatility in selecting physical quantities at differe… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: submitted to IEEE Transactions on Power Systems for peer-review

  17. arXiv:2406.13833  [pdf, other

    stat.ME stat.ML

    Cluster Quilting: Spectral Clustering for Patchwork Learning

    Authors: Lili Zheng, Andersen Chang, Genevera I. Allen

    Abstract: Patchwork learning arises as a new and challenging data collection paradigm where both samples and features are observed in fragmented subsets. Due to technological limits, measurement expense, or multimodal data integration, such patchwork data structures are frequently seen in neuroscience, healthcare, and genomics, among others. Instead of analyzing each data patch separately, it is highly desi… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  18. arXiv:2406.13538  [pdf, other

    physics.optics physics.ins-det

    Farey tree locking of terahertz semiconductor laser frequency combs

    Authors: Guibin Liu, Xuhong Ma, Kang Zhou, Binbin Liu, Lulu Zheng, Xianglong Bi, Shumin Wu, Yanming Lu, Ziping Li, Wenjian Wan, Zhenzhen Zhang, Junsong Peng, Ya Zhang, Heping Zeng, Hua Li

    Abstract: Frequency combs show various applications in molecular fingerprinting, imaging, communications, and so on. In the terahertz frequency range, semiconductor-based quantum cascade lasers (QCLs) are ideal platforms for realizing the frequency comb operation. Although self-started frequency comb operation can be obtained in free-running terahertz QCLs due to the four-wave mixing locking effects, resona… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 22 page, 7 figures

  19. arXiv:2406.11163  [pdf, other

    eess.SP

    Explainable Bayesian Recurrent Neural Smoother to Capture Global State Evolutionary Correlations

    Authors: Shi Yan, Yan Liang, Huayu Zhang, Le Zheng, Difan Zou, Binglu Wang

    Abstract: Through integrating the evolutionary correlations across global states in the bidirectional recursion, an explainable Bayesian recurrent neural smoother (EBRNS) is proposed for offline data-assisted fixed-interval state smoothing. At first, the proposed model, containing global states in the evolutionary interval, is transformed into an equivalent model with bidirectional memory. This transformati… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  20. arXiv:2406.09908  [pdf, other

    cs.LG cs.CV

    What Does Softmax Probability Tell Us about Classifiers Ranking Across Diverse Test Conditions?

    Authors: Weijie Tu, Weijian Deng, Liang Zheng, Tom Gedeon

    Abstract: This work aims to develop a measure that can accurately rank the performance of various classifiers when they are tested on unlabeled data from out-of-distribution (OOD) distributions. We commence by demonstrating that conventional uncertainty metrics, notably the maximum Softmax prediction probability, possess inherent utility in forecasting model generalization across certain OOD contexts. Build… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: TMLR 2024 (https://openreview.net/forum?id=vtiDUgGjyx)

  21. Optimal Kernel Orchestration for Tensor Programs with Korch

    Authors: Muyan Hu, Ashwin Venkatram, Shreyashri Biswas, Balamurugan Marimuthu, Bohan Hou, Gabriele Oliaro, Haojie Wang, Liyan Zheng, Xupeng Miao, Jidong Zhai

    Abstract: Kernel orchestration is the task of mapping the computation defined in different operators of a deep neural network (DNN) to the execution of GPU kernels on modern hardware platforms. Prior approaches optimize kernel orchestration by greedily applying operator fusion, which fuses the computation of multiple operators into a single kernel, and miss a variety of optimization opportunities in kernel… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Fix some typos in the ASPLOS version

    Journal ref: Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems 3 (2024) 755-769

  22. arXiv:2406.09257  [pdf, other

    cs.LG cs.CV

    Assessing Model Generalization in Vicinity

    Authors: Yuchi Liu, Yifan Sun, Jingdong Wang, Liang Zheng

    Abstract: This paper evaluates the generalization ability of classification models on out-of-distribution test sets without depending on ground truth labels. Common approaches often calculate an unsupervised metric related to a specific model property, like confidence or invariance, which correlates with out-of-distribution accuracy. However, these metrics are typically computed for each test sample individ… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  23. arXiv:2406.09187  [pdf, other

    cs.LG

    GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning

    Authors: Zhen Xiang, Linzhi Zheng, Yanjie Li, Junyuan Hong, Qinbin Li, Han Xie, Jiawei Zhang, Zidi Xiong, Chulin Xie, Carl Yang, Dawn Song, Bo Li

    Abstract: The rapid advancement of large language models (LLMs) has catalyzed the deployment of LLM-powered agents across numerous applications, raising new concerns regarding their safety and trustworthiness. Existing methods for enhancing the safety of LLMs are not directly transferable to LLM-powered agents due to their diverse objectives and output modalities. In this paper, we propose GuardAgent, the f… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  24. arXiv:2406.06977  [pdf, other

    cs.LG cs.DB

    Cross-domain-aware Worker Selection with Training for Crowdsourced Annotation

    Authors: Yushi Sun, Jiachuan Wang, Peng Cheng, Libin Zheng, Lei Chen, Jian Yin

    Abstract: Annotation through crowdsourcing draws incremental attention, which relies on an effective selection scheme given a pool of workers. Existing methods propose to select workers based on their performance on tasks with ground truth, while two important points are missed. 1) The historical performances of workers in other tasks. In real-world scenarios, workers need to solve a new task whose correlat… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by ICDE 2024

  25. arXiv:2406.06776  [pdf, other

    cs.CV cs.LG

    SeeFar: Satellite Agnostic Multi-Resolution Dataset for Geospatial Foundation Models

    Authors: James Lowman, Kelly Liu Zheng, Roydon Fraser, Jesse Van Griensven The, Mojtaba Valipour

    Abstract: SeeFar is an evolving collection of multi-resolution satellite images from public and commercial satellites. We specifically curated this dataset for training geospatial foundation models, unconstrained by satellite type. In recent years, advances in technology have made satellite imagery more accessible than ever. More earth-observing satellites have been launched in the last five years than in t… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Work in Progress!

  26. arXiv:2406.06475  [pdf, other

    cs.IR cs.AI

    Survey for Landing Generative AI in Social and E-commerce Recsys -- the Industry Perspectives

    Authors: Da Xu, Danqing Zhang, Guangyu Yang, Bo Yang, Shuyuan Xu, Lingling Zheng, Cindy Liang

    Abstract: Recently, generative AI (GAI), with their emerging capabilities, have presented unique opportunities for augmenting and revolutionizing industrial recommender systems (Recsys). Despite growing research efforts at the intersection of these fields, the integration of GAI into industrial Recsys remains in its infancy, largely due to the intricate nature of modern industrial Recsys infrastructure, ope… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  27. arXiv:2406.05375  [pdf, other

    cs.AI cs.LG

    LEMMA-RCA: A Large Multi-modal Multi-domain Dataset for Root Cause Analysis

    Authors: Lecheng Zheng, Zhengzhang Chen, Dongjie Wang, Chengyuan Deng, Reon Matsuoka, Haifeng Chen

    Abstract: Root cause analysis (RCA) is crucial for enhancing the reliability and performance of complex systems. However, progress in this field has been hindered by the lack of large-scale, open-source datasets tailored for RCA. To bridge this gap, we introduce LEMMA-RCA, a large dataset designed for diverse RCA tasks across multiple domains and modalities. LEMMA-RCA features various real-world fault scena… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  28. arXiv:2406.04314  [pdf, other

    cs.CV

    Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step

    Authors: Zhanhao Liang, Yuhui Yuan, Shuyang Gu, Bohan Chen, Tiankai Hang, Ji Li, Liang Zheng

    Abstract: Recently, Direct Preference Optimization (DPO) has extended its success from aligning large language models (LLMs) to aligning text-to-image diffusion models with human preferences. Unlike most existing DPO methods that assume all diffusion steps share a consistent preference order with the final generated images, we argue that this assumption neglects step-specific denoising performance and that… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  29. arXiv:2406.01431  [pdf, other

    cs.RO

    Deep Stochastic Kinematic Models for Probabilistic Motion Forecasting in Traffic

    Authors: Laura Zheng, Sanghyun Son, Jing Liang, Xijun Wang, Brian Clipp, Ming C. Lin

    Abstract: Kinematic priors have shown to be helpful in boosting generalization and performance in prior work on trajectory forecasting. Specifically, kinematic priors have been applied such that models predict a set of actions instead of future output trajectories. By unrolling predicted trajectories via time integration and models of kinematic dynamics, predicted trajectories are not only kinematically fea… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 8 pages

  30. arXiv:2406.01425  [pdf, other

    cs.CV

    Sensitivity-Informed Augmentation for Robust Segmentation

    Authors: Laura Zheng, Wenjie Wei, Tony Wu, Jacob Clements, Shreelekha Revankar, Andre Harrison, Yu Shen, Ming C. Lin

    Abstract: Segmentation is an integral module in many visual computing applications such as virtual try-on, medical imaging, autonomous driving, and agricultural automation. These applications often involve either widespread consumer use or highly variable environments, both of which can degrade the quality of visual sensor data, whether from a common mobile phone or an expensive satellite imaging camera. In… ▽ More

    Submitted 16 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages

  31. arXiv:2406.00421  [pdf

    eess.SY

    Modal Analysis of Power System with High CIG Penetration Based on Impedance Models

    Authors: Le Zheng, Jiajie Zheng, Jiajian Lin, Chongru Liu

    Abstract: This paper explores the modal analysis of power systems with high Converter-Interfaced Generation (CIG) penetration utilizing an impedance-based modeling approach. Traditional modal analysis based on the state-space model (MASS) requires comprehensive control structures and parameters of each system element, a challenging prerequisite as converters increasingly integrate into power systems and the… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  32. arXiv:2405.20252  [pdf, other

    cs.CL

    Towards Hierarchical Multi-Agent Workflows for Zero-Shot Prompt Optimization

    Authors: Yuchi Liu, Jaskirat Singh, Gaowen Liu, Ali Payani, Liang Zheng

    Abstract: Large language models (LLMs) have shown great progress in responding to user questions, allowing for a multitude of diverse applications. Yet, the quality of LLM outputs heavily depends on the prompt design, where a good prompt might enable the LLM to answer a very challenging question correctly. Therefore, recent works have developed many strategies for improving the prompt, including both manual… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  33. arXiv:2405.15013  [pdf, other

    cs.LG

    Make Inference Faster: Efficient GPU Memory Management for Butterfly Sparse Matrix Multiplication

    Authors: Antoine Gonon, Léon Zheng, Pascal Carrivain, Quoc-Tung Le

    Abstract: This paper is the first to assess the state of existing sparse matrix multiplication algorithms on GPU for the butterfly structure, a promising form of sparsity. This is achieved through a comprehensive benchmark that can be easily modified to add a new implementation. The goal is to provide a simple tool for users to select the optimal implementation based on their settings. Using this benchmark,… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  34. arXiv:2405.14359  [pdf, other

    cs.IR

    Look into the Future: Deep Contextualized Sequential Recommendation

    Authors: Lei Zheng, Ning Li, Yanhuan Huang, Ruiwen Xu, Weinan Zhang, Yong Yu

    Abstract: Sequential recommendation focuses on mining useful patterns from the user behavior history to better estimate his preference on the candidate items. Previous solutions adopt recurrent networks or retrieval methods to obtain the user's profile representation so as to perform the preference estimation. In this paper, we propose a novel framework of sequential recommendation called Look into the Futu… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2404.18304 by other authors

  35. arXiv:2405.13548  [pdf, other

    cs.SE cs.CL

    ECLIPSE: Semantic Entropy-LCS for Cross-Lingual Industrial Log Parsing

    Authors: Wei Zhang, Xianfu Cheng, Yi Zhang, Jian Yang, Hongcheng Guo, Zhoujun Li, Xiaolin Yin, Xiangyuan Guan, Xu Shi, Liangfan Zheng, Bo Zhang

    Abstract: Log parsing, a vital task for interpreting the vast and complex data produced within software architectures faces significant challenges in the transition from academic benchmarks to the industrial domain. Existing log parsers, while highly effective on standardized public datasets, struggle to maintain performance and efficiency when confronted with the sheer scale and diversity of real-world ind… ▽ More

    Submitted 24 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  36. arXiv:2405.12503  [pdf, other

    cs.CV

    CLRKDNet: Speeding up Lane Detection with Knowledge Distillation

    Authors: Weiqing Qi, Guoyang Zhao, Fulong Ma, Linwei Zheng, Ming Liu

    Abstract: Road lanes are integral components of the visual perception systems in intelligent vehicles, playing a pivotal role in safe navigation. In lane detection tasks, balancing accuracy with real-time performance is essential, yet existing methods often sacrifice one for the other. To address this trade-off, we introduce CLRKDNet, a streamlined model that balances detection accuracy with real-time perfo… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  37. arXiv:2405.12107  [pdf, other

    cs.CV cs.CL

    Imp: Highly Capable Large Multimodal Models for Mobile Devices

    Authors: Zhenwei Shao, Zhou Yu, Jun Yu, Xuecheng Ouyang, Lihao Zheng, Zhenbiao Gai, Mingyang Wang, Jiajun Ding

    Abstract: By harnessing the capabilities of large language models (LLMs), recent large multimodal models (LMMs) have shown remarkable versatility in open-world multimodal understanding. Nevertheless, they are usually parameter-heavy and computation-intensive, thus hindering their applicability in resource-constrained scenarios. To this end, several lightweight LMMs have been proposed successively to maximiz… ▽ More

    Submitted 29 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: fix some typos and correct a few number in the tables

  38. arXiv:2405.09523  [pdf, ps, other

    math.ST cs.IT

    On Semi-supervised Estimation of Discrete Distributions under f-divergences

    Authors: Hasan Sabri Melihcan Erol, Lizhong Zheng

    Abstract: We study the problem of estimating the joint probability mass function (pmf) over two random variables. In particular, the estimation is based on the observation of $m$ samples containing both variables and $n$ samples missing one fixed variable. We adopt the minimax framework with $l^p_p$ loss functions. Recent work established that univariate minimax estimator combinations achieve minimax risk w… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: Full version. Presented in ISIT-24. arXiv admin note: text overlap with arXiv:2305.07955

  39. arXiv:2405.07029  [pdf

    cs.SD eess.AS

    A framework of text-dependent speaker verification for chinese numerical string corpus

    Authors: Litong Zheng, Feng Hong, Weijie Xu, Wan Zheng

    Abstract: The Chinese numerical string corpus, serves as a valuable resource for speaker verification, particularly in financial transactions. Researches indicate that in short speech scenarios, text-dependent speaker verification (TD-SV) consistently outperforms text-independent speaker verification (TI-SV). However, TD-SV potentially includes the validation of text information, that can be negatively impa… ▽ More

    Submitted 21 May, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2312.01645

  40. arXiv:2404.18829  [pdf, other

    nucl-th hep-ph nucl-ex

    Disentangling the development of collective flow in high energy proton proton collisions with a multiphase transport model

    Authors: Liang Zheng, Lian Liu, Zi-Wei Lin, Qi-Ye Shou, Zhong-Bao Yin

    Abstract: In this work, we investigate the collective flow development in high energy proton proton (pp) collisions with a multiphase transport model (AMPT) based on PYTHIA8 initial conditions with a sub-nucleon structure. It is found that the PYTHIA8 based AMPT model can reasonably describe both the charged hadron productions and elliptic flow experimental data measured in pp collisions at $\sqrt{s}=13$ Te… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  41. arXiv:2404.15678  [pdf, other

    cs.IR cs.AI

    Retrieval and Distill: A Temporal Data Shift-Free Paradigm for Online Recommendation System

    Authors: Lei Zheng, Ning Li, Weinan Zhang, Yong Yu

    Abstract: Current recommendation systems are significantly affected by a serious issue of temporal data shift, which is the inconsistency between the distribution of historical data and that of online data. Most existing models focus on utilizing updated data, overlooking the transferable, temporal data shift-free information that can be learned from shifting data. We propose the Temporal Invariance of Asso… ▽ More

    Submitted 13 June, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

  42. arXiv:2404.14850  [pdf, other

    cs.CL cs.LG q-bio.BM

    Simple, Efficient and Scalable Structure-aware Adapter Boosts Protein Language Models

    Authors: Yang Tan, Mingchen Li, Bingxin Zhou, Bozitao Zhong, Lirong Zheng, Pan Tan, Ziyi Zhou, Huiqun Yu, Guisheng Fan, Liang Hong

    Abstract: Fine-tuning Pre-trained protein language models (PLMs) has emerged as a prominent strategy for enhancing downstream prediction tasks, often outperforming traditional supervised learning approaches. As a widely applied powerful technique in natural language processing, employing Parameter-Efficient Fine-Tuning techniques could potentially enhance the performance of PLMs. However, the direct transfe… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 30 pages, 4 figures, 8 tables

  43. arXiv:2404.13016  [pdf, other

    cs.CV cs.LG stat.ML

    Optimizing Calibration by Gaining Aware of Prediction Correctness

    Authors: Yuchi Liu, Lei Wang, Yuli Zou, James Zou, Liang Zheng

    Abstract: Model calibration aims to align confidence with prediction correctness. The Cross-Entropy (CE) loss is widely used for calibrator training, which enforces the model to increase confidence on the ground truth class. However, we find the CE loss has intrinsic limitations. For example, for a narrow misclassification, a calibrator trained by the CE loss often produces high confidence on the wrongly pr… ▽ More

    Submitted 24 April, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  44. arXiv:2404.12135  [pdf, other

    cs.MA cs.CR cs.DC

    mABC: multi-Agent Blockchain-Inspired Collaboration for root cause analysis in micro-services architecture

    Authors: Wei Zhang, Hongcheng Guo, Jian Yang, Yi Zhang, Chaoran Yan, Zhoujin Tian, Hangyuan Ji, Zhoujun Li, Tongliang Li, Tieqiao Zheng, Chao Chen, Yi Liang, Xu Shi, Liangfan Zheng, Bo Zhang

    Abstract: The escalating complexity of micro-services architecture in cloud-native technologies poses significant challenges for maintaining system stability and efficiency. To conduct root cause analysis (RCA) and resolution of alert events, we propose a pioneering framework, multi-Agent Blockchain-inspired Collaboration for root cause analysis in micro-services architecture (mABC), to revolutionize the AI… ▽ More

    Submitted 3 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  45. arXiv:2404.11943  [pdf, other

    cs.HC

    AgentCoord: Visually Exploring Coordination Strategy for LLM-based Multi-Agent Collaboration

    Authors: Bo Pan, Jiaying Lu, Ke Wang, Li Zheng, Zhen Wen, Yingchaojie Feng, Minfeng Zhu, Wei Chen

    Abstract: The potential of automatic task-solving through Large Language Model (LLM)-based multi-agent collaboration has recently garnered widespread attention from both the research community and industry. While utilizing natural language to coordinate multiple agents presents a promising avenue for democratizing agent technology for general users, designing coordination strategies remains challenging with… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  46. arXiv:2404.11139  [pdf, other

    cs.CV

    GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement

    Authors: Linfang Zheng, Tze Ho Elden Tse, Chen Wang, Yinghan Sun, Hua Chen, Ales Leonardis, Wei Zhang

    Abstract: Object pose refinement is essential for robust object pose estimation. Previous work has made significant progress towards instance-level object pose refinement. Yet, category-level pose refinement is a more challenging problem due to large shape variations within a category and the discrepancies between the target object and the shape prior. To address these challenges, we introduce a novel archi… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: The IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024

  47. arXiv:2404.09432  [pdf, other

    cs.CV cs.AI cs.LG

    The 8th AI City Challenge

    Authors: Shuo Wang, David C. Anastasiu, Zheng Tang, Ming-Ching Chang, Yue Yao, Liang Zheng, Mohammed Shaiqur Rahman, Meenakshi S. Arya, Anuj Sharma, Pranamesh Chakraborty, Sanjita Prajapati, Quan Kong, Norimasa Kobori, Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Fady Alnajjar, Ganzorig Batnasan, Ping-Yang Chen, Jun-Wei Hsieh, Xunlei Wu, Sameer Satish Pusegaonkar, Yizhou Wang, Sujit Biswas, Rama Chellappa

    Abstract: The eighth AI City Challenge highlighted the convergence of computer vision and artificial intelligence in areas like retail, warehouse settings, and Intelligent Traffic Systems (ITS), presenting significant research opportunities. The 2024 edition featured five tracks, attracting unprecedented interest from 726 teams in 47 countries and regions. Track 1 dealt with multi-target multi-camera (MTMC)… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Summary of the 8th AI City Challenge Workshop in conjunction with CVPR 2024

  48. arXiv:2404.06860  [pdf, other

    cs.CV

    Monocular 3D lane detection for Autonomous Driving: Recent Achievements, Challenges, and Outlooks

    Authors: Fulong Ma, Weiqing Qi, Guoyang Zhao, Linwei Zheng, Sheng Wang, Yuxuan Liu, Ming Liu

    Abstract: 3D lane detection is essential in autonomous driving as it extracts structural and traffic information from the road in three-dimensional space, aiding self-driving cars in logical, safe, and comfortable path planning and motion control. Given the cost of sensors and the advantages of visual data in color information, 3D lane detection based on monocular vision is an important research direction i… ▽ More

    Submitted 19 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  49. arXiv:2404.04557  [pdf, other

    cs.CV

    Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered Scenes

    Authors: Zhiyuan Yu, Zheng Qin, Lintao Zheng, Kai Xu

    Abstract: Multi-instance point cloud registration estimates the poses of multiple instances of a model point cloud in a scene point cloud. Extracting accurate point correspondence is to the center of the problem. Existing approaches usually treat the scene point cloud as a whole, overlooking the separation of instances. Therefore, point features could be easily polluted by other points from the background o… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  50. arXiv:2404.02127  [pdf, other

    cs.CL cs.AI cs.LG

    FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning

    Authors: Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning

    Abstract: Instruction tuning is an important step in making language models useful for direct user interaction. However, many legal tasks remain out of reach for most open LLMs and there do not yet exist any large scale instruction datasets for the domain. This critically limits research in this application area. In this work, we curate LawInstruct, a large legal instruction dataset, covering 17 jurisdictio… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    MSC Class: 68T50 ACM Class: I.2