Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 473 results for author: Shen, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18962  [pdf, other

    cs.IR

    Multi-modal Food Recommendation using Clustering and Self-supervised Learning

    Authors: Yixin Zhang, Xin Zhou, Qianwen Meng, Fanglin Zhu, Yonghui Xu, Zhiqi Shen, Lizhen Cui

    Abstract: Food recommendation systems serve as pivotal components in the realm of digital lifestyle services, designed to assist users in discovering recipes and food items that resonate with their unique dietary predilections. Typically, multi-modal descriptions offer an exhaustive profile for each recipe, thereby ensuring recommendations that are both personalized and accurate. Our preliminary investigati… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Working paper

  2. arXiv:2406.16137  [pdf, other

    cs.CV

    MLPHand: Real Time Multi-View 3D Hand Mesh Reconstruction via MLP Modeling

    Authors: Jian Yang, Jiakun Li, Guoming Li, Zhen Shen, Huai-Yu Wu, Zhaoxin Fan, Heng Huang

    Abstract: Multi-view hand mesh reconstruction is a critical task for applications in virtual reality and human-computer interaction, but it remains a formidable challenge. Although existing multi-view hand reconstruction methods achieve remarkable accuracy, they typically come with an intensive computational burden that hinders real-time inference. To this end, we propose MLPHand, a novel method designed fo… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  3. arXiv:2406.15786  [pdf, other

    cs.LG cs.AI cs.CL

    What Matters in Transformers? Not All Attention is Needed

    Authors: Shwai He, Guoheng Sun, Zheyu Shen, Ang Li

    Abstract: Scaling Transformer-based large language models (LLMs) has demonstrated promising performance across various tasks. However, this scaling also introduces redundant structures, posing challenges for real-world deployment. Despite some recognition of redundancy in LLMs, the variability of redundancy across different structures, such as MLP and Attention layers, is under-explored. In this work, we in… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 15 pages, 13 figures, 6 tables

  4. arXiv:2406.14318  [pdf, other

    cs.CR cs.AI cs.CL

    The Fire Thief Is Also the Keeper: Balancing Usability and Privacy in Prompts

    Authors: Zhili Shen, Zihang Xi, Ying He, Wei Tong, Jingyu Hua, Sheng Zhong

    Abstract: The rapid adoption of online chatbots represents a significant advancement in artificial intelligence. However, this convenience brings considerable privacy concerns, as prompts can inadvertently contain sensitive information exposed to large language models (LLMs). Limited by high computational costs, reduced task usability, and excessive system modifications, previous works based on local deploy… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  5. arXiv:2406.13225  [pdf, other

    cs.LG cs.AI cs.IR

    Communication-Efficient Federated Knowledge Graph Embedding with Entity-Wise Top-K Sparsification

    Authors: Xiaoxiong Zhang, Zhiwei Zeng, Xin Zhou, Dusit Niyato, Zhiqi Shen

    Abstract: Federated Knowledge Graphs Embedding learning (FKGE) encounters challenges in communication efficiency stemming from the considerable size of parameters and extensive communication rounds. However, existing FKGE methods only focus on reducing communication rounds by conducting multiple rounds of local training in each communication round, and ignore reducing the size of parameters transmitted with… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  6. arXiv:2406.11943  [pdf, other

    cs.IR cs.AI

    Personalized Federated Knowledge Graph Embedding with Client-Wise Relation Graph

    Authors: Xiaoxiong Zhang, Zhiwei Zeng, Xin Zhou, Dusit Niyato, Zhiqi Shen

    Abstract: Federated Knowledge Graph Embedding (FKGE) has recently garnered considerable interest due to its capacity to extract expressive representations from distributed knowledge graphs, while concurrently safeguarding the privacy of individual clients. Existing FKGE methods typically harness the arithmetic mean of entity embeddings from all clients as the global supplementary knowledge, and learn a repl… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  7. arXiv:2406.11418  [pdf, other

    cs.CL

    BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM

    Authors: Zhewen Shen, Aditya Joshi, Ruey-Cheng Chen

    Abstract: Children from bilingual backgrounds benefit from interactions with parents and teachers to re-acquire their heritage language. In this paper, we investigate how this insight from behavioral study can be incorporated into the learning of small-scale language models. We introduce BAMBINO-LM, a continual pretraining strategy for BabyLM that uses a novel combination of alternation and PPO-based perple… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Short paper; Under review

  8. arXiv:2406.09133  [pdf

    cs.CL

    RH-SQL: Refined Schema and Hardness Prompt for Text-to-SQL

    Authors: Jiawen Yi, Guo Chen, Zixiang Shen

    Abstract: Text-to-SQL is a technology that converts natural language queries into the structured query language SQL. A novel research approach that has recently gained attention focuses on methods based on the complexity of SQL queries, achieving notable performance improvements. However, existing methods entail significant storage and training costs, which hampers their practical application. To address th… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 4 pages, 2 figures, 2024 6th International Conference on Electronic Engineering and Informatics (EEI 2024)

  9. arXiv:2406.09084  [pdf, other

    stat.ML cs.LG

    Operator-informed score matching for Markov diffusion models

    Authors: Zheyang Shen, Chris J. Oates

    Abstract: Diffusion models are typically trained using score matching, yet score matching is agnostic to the particular forward process that defines the model. This paper argues that Markov diffusion models enjoy an advantage over other types of diffusion model, as their associated operators can be exploited to improve the training process. In particular, (i) there exists an explicit formal solution to the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Preprint; 19 pages, 5 figures

  10. arXiv:2406.07835  [pdf, other

    cs.CL cs.AI

    SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature

    Authors: David Wadden, Kejian Shi, Jacob Morrison, Aakanksha Naik, Shruti Singh, Nitzan Barzilay, Kyle Lo, Tom Hope, Luca Soldaini, Shannon Zejiang Shen, Doug Downey, Hannaneh Hajishirzi, Arman Cohan

    Abstract: We present SciRIFF (Scientific Resource for Instruction-Following and Finetuning), a dataset of 137K instruction-following demonstrations for 54 tasks covering five essential scientific literature understanding capabilities: information extraction, summarization, question answering, claim verification, and classification. SciRIFF demonstrations are notable for their long input contexts, detailed t… ▽ More

    Submitted 18 June, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: Submitted to NeurIPS Datasets and Benchmarks 2024

  11. arXiv:2406.07545  [pdf, other

    cs.CL cs.AI

    Open-LLM-Leaderboard: From Multi-choice to Open-style Questions for LLMs Evaluation, Benchmark, and Arena

    Authors: Aidar Myrzakhan, Sondos Mahmoud Bsharat, Zhiqiang Shen

    Abstract: Multiple-choice questions (MCQ) are frequently used to assess large language models (LLMs). Typically, an LLM is given a question and selects the answer deemed most probable after adjustments for factors like length. Unfortunately, LLMs may inherently favor certain answer choice IDs, such as A/B/C/D, due to inherent biases of priori unbalanced probabilities, influencing the prediction of answers b… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Code and dataset are available at https://github.com/VILA-Lab/Open-LLM-Leaderboard

  12. arXiv:2406.04689  [pdf, other

    cs.CV

    CDeFuse: Continuous Decomposition for Infrared and Visible Image Fusion

    Authors: Haolong Ma, Hui Li, Chunyang Cheng, Xiaoning Song, Zhongwei Shen

    Abstract: As a common image processing technique, image decomposition is often used to extract complementary information between modalities. In current decomposition-based image fusion methods, typically, source images are decomposed into three parts at single scale (i.e., visible-exclusive part, infrared-exclusive part, and common part) and lacking interaction between modalities during the decomposition pr… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  13. arXiv:2406.03933  [pdf, other

    cs.CR cs.IR

    Beyond Similarity: Personalized Federated Recommendation with Composite Aggregation

    Authors: Honglei Zhang, Haoxuan Li, Jundong Chen, Sen Cui, Kunda Yan, Abudukelimu Wuerkaixi, Xin Zhou, Zhiqi Shen, Yidong Li

    Abstract: Federated recommendation aims to collect global knowledge by aggregating local models from massive devices, to provide recommendations while ensuring privacy. Current methods mainly leverage aggregation functions invented by federated vision community to aggregate parameters from similar clients, e.g., clustering aggregation. Despite considerable performance, we argue that it is suboptimal to appl… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  14. arXiv:2406.02618  [pdf, other

    q-bio.QM cs.AI eess.IV

    Immunocto: a massive immune cell database auto-generated for histopathology

    Authors: Mikaël Simard, Zhuoyan Shen, Maria A. Hawkins, Charles-Antoine Collins-Fekete

    Abstract: With the advent of novel cancer treatment options such as immunotherapy, studying the tumour immune micro-environment is crucial to inform on prognosis and understand response to therapeutic agents. A key approach to characterising the tumour immune micro-environment may be through combining (1) digitised microscopic high-resolution optical images of hematoxylin and eosin (H&E) stained tissue sect… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  15. arXiv:2405.20764  [pdf, other

    cs.CV

    CoMoFusion: Fast and High-quality Fusion of Infrared and Visible Image with Consistency Model

    Authors: Zhiming Meng, Hui Li, Zeyang Zhang, Zhongwei Shen, Yunlong Yu, Xiaoning Song, Xiaojun Wu

    Abstract: Generative models are widely utilized to model the distribution of fused images in the field of infrared and visible image fusion. However, current generative models based fusion methods often suffer from unstable training and slow inference speed. To tackle this problem, a novel fusion method based on consistency model is proposed, termed as CoMoFusion, which can generate the high-quality images… ▽ More

    Submitted 11 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  16. arXiv:2405.18373  [pdf, other

    stat.ML cs.LG math.OC

    A Hessian-Aware Stochastic Differential Equation for Modelling SGD

    Authors: Xiang Li, Zebang Shen, Liang Zhang, Niao He

    Abstract: Continuous-time approximation of Stochastic Gradient Descent (SGD) is a crucial tool to study its escaping behaviors from stationary points. However, existing stochastic differential equation (SDE) models fail to fully capture these behaviors, even for simple quadratic objectives. Built on a novel stochastic backward error analysis framework, we derive the Hessian-Aware Stochastic Modified Equatio… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  17. arXiv:2405.16395  [pdf, other

    cs.LG

    Daily Physical Activity Monitoring -- Adaptive Learning from Multi-source Motion Sensor Data

    Authors: Haoting Zhang, Donglin Zhan, Yunduan Lin, Jinghai He, Qing Zhu, Zuo-Jun Max Shen, Zeyu Zheng

    Abstract: In healthcare applications, there is a growing need to develop machine learning models that use data from a single source, such as that from a wrist wearable device, to monitor physical activities, assess health risks, and provide immediate health recommendations or interventions. However, the limitation of using single-source data often compromises the model's accuracy, as it fails to capture the… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  18. arXiv:2405.15843  [pdf, other

    cs.CV cs.AI

    SpotNet: An Image Centric, Lidar Anchored Approach To Long Range Perception

    Authors: Louis Foucard, Samar Khanna, Yi Shi, Chi-Kuei Liu, Quinn Z Shen, Thuyen Ngo, Zi-Xiang Xia

    Abstract: In this paper, we propose SpotNet: a fast, single stage, image-centric but LiDAR anchored approach for long range 3D object detection. We demonstrate that our approach to LiDAR/image sensor fusion, combined with the joint learning of 2D and 3D detection tasks, can lead to accurate 3D object detection with very sparse LiDAR support. Unlike more recent bird's-eye-view (BEV) sensor-fusion methods whi… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  19. arXiv:2405.14778  [pdf, ps, other

    stat.ML cs.LG

    Optimal Rates for Vector-Valued Spectral Regularization Learning Algorithms

    Authors: Dimitri Meunier, Zikai Shen, Mattes Mollenhauer, Arthur Gretton, Zhu Li

    Abstract: We study theoretical properties of a broad class of regularized algorithms with vector-valued output. These spectral algorithms include kernel ridge regression, kernel principal component regression, various implementations of gradient descent and many more. Our contributions are twofold. First, we rigorously confirm the so-called saturation effect for ridge regression with vector-valued output by… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  20. arXiv:2405.12786  [pdf, other

    cs.CR

    Rethinking the Vulnerabilities of Face Recognition Systems:From a Practical Perspective

    Authors: Jiahao Chen, Zhiqiang Shen, Yuwen Pu, Chunyi Zhou, Changjiang Li, Jiliang Li, Ting Wang, Shouling Ji

    Abstract: Face Recognition Systems (FRS) have increasingly integrated into critical applications, including surveillance and user authentication, highlighting their pivotal role in modern security systems. Recent studies have revealed vulnerabilities in FRS to adversarial (e.g., adversarial patch attacks) and backdoor attacks (e.g., training data poisoning), raising significant concerns about their reliabil… ▽ More

    Submitted 8 June, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: 19 pages,version 3

  21. arXiv:2405.11671  [pdf, other

    cs.DS

    BYO: A Unified Framework for Benchmarking Large-Scale Graph Containers

    Authors: Brian Wheatman, Xiaojun Dong, Zheqi Shen, Laxman Dhulipala, Jakub Łącki, Prashant Pandey, Helen Xu

    Abstract: A fundamental building block in any graph algorithm is a graph container - a data structure used to represent the graph. Ideally, a graph container enables efficient access to the underlying graph, has low space usage, and supports updating the graph efficiently. In this paper, we conduct an extensive empirical evaluation of graph containers designed to support running algorithms on large graphs.… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  22. arXiv:2405.10879  [pdf, other

    cs.CV

    One registration is worth two segmentations

    Authors: Shiqi Huang, Tingfa Xu, Ziyi Shen, Shaheer Ullah Saeed, Wen Yan, Dean Barratt, Yipeng Hu

    Abstract: The goal of image registration is to establish spatial correspondence between two or more images, traditionally through dense displacement fields (DDFs) or parametric transformations (e.g., rigid, affine, and splines). Rethinking the existing paradigms of achieving alignment via spatial transformations, we uncover an alternative but more intuitive correspondence representation: a set of correspond… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Early Accepted by MICCAI2024

  23. arXiv:2405.09777  [pdf, other

    cs.CV

    Rethinking Barely-Supervised Segmentation from an Unsupervised Domain Adaptation Perspective

    Authors: Zhiqiang Shen, Peng Cao, Junming Su, Jinzhu Yang, Osmar R. Zaiane

    Abstract: This paper investigates an extremely challenging problem, barely-supervised medical image segmentation (BSS), where the training dataset comprises limited labeled data with only single-slice annotations and numerous unlabeled images. Currently, state-of-the-art (SOTA) BSS methods utilize a registration-based paradigm, depending on image registration to propagate single-slice annotations into volum… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  24. arXiv:2405.09463  [pdf, other

    cs.CV

    Gaze-DETR: Using Expert Gaze to Reduce False Positives in Vulvovaginal Candidiasis Screening

    Authors: Yan Kong, Sheng Wang, Jiangdong Cai, Zihao Zhao, Zhenrong Shen, Yonghao Li, Manman Fei, Qian Wang

    Abstract: Accurate detection of vulvovaginal candidiasis is critical for women's health, yet its sparse distribution and visually ambiguous characteristics pose significant challenges for accurate identification by pathologists and neural networks alike. Our eye-tracking data reveals that areas garnering sustained attention - yet not marked by experts after deliberation - are often aligned with false positi… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: MICCAI-2024 early accept. Our code is available at https://github.com/YanKong0408/Gaze-DETR

  25. arXiv:2405.09113  [pdf, ps, other

    cs.LG

    Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization

    Authors: Kai Hu, Weichen Yu, Tianjun Yao, Xiang Li, Wenhe Liu, Lijun Yu, Yining Li, Kai Chen, Zhiqiang Shen, Matt Fredrikson

    Abstract: Recent research indicates that large language models (LLMs) are susceptible to jailbreaking attacks that can generate harmful content. This paper introduces a novel token-level attack method, Adaptive Dense-to-Sparse Constrained Optimization (ADC), which effectively jailbreaks several open-source LLMs. Our approach relaxes the discrete jailbreak optimization into a continuous optimization and prog… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  26. arXiv:2405.07784  [pdf, other

    cs.CV

    Generating Human Motion in 3D Scenes from Text Descriptions

    Authors: Zhi Cen, Huaijin Pi, Sida Peng, Zehong Shen, Minghui Yang, Shuai Zhu, Hujun Bao, Xiaowei Zhou

    Abstract: Generating human motions from textual descriptions has gained growing research interest due to its wide range of applications. However, only a few works consider human-scene interactions together with text conditions, which is crucial for visual and physical realism. This paper focuses on the task of generating human motions in 3D indoor scenes given text descriptions of the human-scene interactio… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: Project page: https://zju3dv.github.io/text_scene_motion

  27. arXiv:2405.07552  [pdf, other

    stat.ML cs.LG stat.ME

    Distributed High-Dimensional Quantile Regression: Estimation Efficiency and Support Recovery

    Authors: Caixing Wang, Ziliang Shen

    Abstract: In this paper, we focus on distributed estimation and support recovery for high-dimensional linear quantile regression. Quantile regression is a popular alternative tool to the least squares regression for robustness against outliers and data heterogeneity. However, the non-smoothness of the check loss function poses big challenges to both computation and theory in the distributed setting. To tack… ▽ More

    Submitted 1 June, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: Forty-first International Conference on Machine Learning (ICML 2024), 27 pages, 4 figures, 14 tables

  28. arXiv:2405.07237  [pdf, other

    cs.RO

    Soft Contact Simulation and Manipulation Learning of Deformable Objects with Vision-based Tactile Sensor

    Authors: Jianhua Shan, Yuhao Sun, Shixin Zhang, Fuchun Sun, Zixi Chen, Zirong Shen, Cesare Stefanini, Yiyong Yang, Shan Luo, Bin Fang

    Abstract: Deformable object manipulation is a classical and challenging research area in robotics. Compared with rigid object manipulation, this problem is more complex due to the deformation properties including elastic, plastic, and elastoplastic deformation. In this paper, we describe a new deformable object manipulation method including soft contact simulation, manipulation learning, and sim-to-real tra… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

  29. arXiv:2405.03329  [pdf, other

    cs.LG stat.ML

    Policy Learning for Balancing Short-Term and Long-Term Rewards

    Authors: Peng Wu, Ziyu Shen, Feng Xie, Zhongyao Wang, Chunchen Liu, Yan Zeng

    Abstract: Empirical researchers and decision-makers spanning various domains frequently seek profound insights into the long-term impacts of interventions. While the significance of long-term outcomes is undeniable, an overemphasis on them may inadvertently overshadow short-term gains. Motivated by this, this paper formalizes a new framework for learning the optimal policy that effectively balances both lon… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  30. arXiv:2405.03181  [pdf, other

    cs.DC

    Collaborative Satellite Computing through Adaptive DNN Task Splitting and Offloading

    Authors: Shifeng Peng, Xuefeng Hou, Zhishu Shen, Qiushi Zheng, Jiong Jin, Atsushi Tagami, Jingling Yuan

    Abstract: Satellite computing has emerged as a promising technology for next-generation wireless networks. This innovative technology provides data processing capabilities, which facilitates the widespread implementation of artificial intelligence (AI)-based applications, especially for image processing tasks involving deep neural network (DNN). With the limited computing resources of an individual satellit… ▽ More

    Submitted 20 May, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted by 29th IEEE Symposium on Computers and Communications (ISCC)

  31. arXiv:2405.03060  [pdf, other

    cs.LG

    Tree-based Ensemble Learning for Out-of-distribution Detection

    Authors: Zhaiming Shen, Menglun Wang, Guang Cheng, Ming-Jun Lai, Lin Mu, Ruihao Huang, Qi Liu, Hao Zhu

    Abstract: Being able to successfully determine whether the testing samples has similar distribution as the training samples is a fundamental question to address before we can safely deploy most of the machine learning models into practice. In this paper, we propose TOOD detection, a simple yet effective tree-based out-of-distribution (TOOD) detection mechanism to determine if a set of unseen samples will ha… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  32. arXiv:2405.02935  [pdf, other

    cs.CL

    Enabling Patient-side Disease Prediction via the Integration of Patient Narratives

    Authors: Zhixiang Su, Yinan Zhang, Jiazheng Jing, Jie Xiao, Zhiqi Shen

    Abstract: Disease prediction holds considerable significance in modern healthcare, because of its crucial role in facilitating early intervention and implementing effective prevention measures. However, most recent disease prediction approaches heavily rely on laboratory test outcomes (e.g., blood tests and medical imaging from X-rays). Gaining access to such data for precise disease prediction is often a c… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  33. arXiv:2405.02914  [pdf, other

    cs.RO

    Simulation of Optical Tactile Sensors Supporting Slip and Rotation using Path Tracing and IMPM

    Authors: Zirong Shen, Yuhao Sun, Shixin Zhang, Zixi Chen, Heyi Sun, Fuchun Sun, Bin Fang

    Abstract: Optical tactile sensors are extensively utilized in intelligent robot manipulation due to their ability to acquire high-resolution tactile information at a lower cost. However, achieving adequate reality and versatility in simulating optical tactile sensors is challenging. In this paper, we propose a simulation method and validate its effectiveness through experiments. We utilize path tracing for… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  34. arXiv:2405.01326  [pdf, other

    cs.CV

    Multi-modal Learnable Queries for Image Aesthetics Assessment

    Authors: Zhiwei Xiong, Yunfan Zhang, Zhiqi Shen, Peiran Ren, Han Yu

    Abstract: Image aesthetics assessment (IAA) is attracting wide interest with the prevalence of social media. The problem is challenging due to its subjective and ambiguous nature. Instead of directly extracting aesthetic features solely from the image, user comments associated with an image could potentially provide complementary knowledge that is useful for IAA. With existing large-scale pre-trained models… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: Accepted by ICME2024

  35. arXiv:2405.00705  [pdf, other

    cs.CL cs.LG

    SHED: Shapley-Based Automated Dataset Refinement for Instruction Fine-Tuning

    Authors: Yexiao He, Ziyao Wang, Zheyu Shen, Guoheng Sun, Yucong Dai, Yongkai Wu, Hongyi Wang, Ang Li

    Abstract: The pre-trained Large Language Models (LLMs) can be adapted for many downstream tasks and tailored to align with human preferences through fine-tuning. Recent studies have discovered that LLMs can achieve desirable performance with only a small amount of high-quality data, suggesting that a large amount of the data in these extensive datasets is redundant or even harmful. Identifying high-quality… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

  36. arXiv:2404.19563  [pdf, other

    cs.CL

    RepEval: Effective Text Evaluation with LLM Representation

    Authors: Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xinbing Wang, Chenghu Zhou

    Abstract: Automatic evaluation metrics for generated texts play an important role in the NLG field, especially with the rapid growth of LLMs. However, existing metrics are often limited to specific scenarios, making it challenging to meet the evaluation requirements of expanding LLM applications. Therefore, there is a demand for new, flexible, and effective metrics. In this study, we introduce RepEval, the… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  37. arXiv:2404.19372  [pdf, other

    cs.NI

    AutoNet: Automatic Reachability Policy Management in Public Cloud Networks

    Authors: German Sviridov, Zheng Tao Shen, Jorge Cardoso

    Abstract: Virtual Private Cloud (VPC) is the main network abstraction technology used in public cloud systems. VPCs are composed of a set of network services that permit the definition of complex network reachability properties among internal and external cloud entities such as tenants' VMs or some generic internet nodes. Although hiding the underlying complexity through a comprehensible abstraction layer,… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  38. arXiv:2404.16322  [pdf, other

    cs.DB

    Bridging Speed and Accuracy to Approximate $K$-Nearest Neighbor Search

    Authors: Mingyu Yang, Jiabao Jin, Xiangyu Wang, Zhitao Shen, Wei Jia, Wentao Li, Wei Wang

    Abstract: Approximate K-Nearest Neighbor (AKNN) search in high-dimensional spaces is a critical yet challenging problem. The efficiency of AKNN search largely depends on the computation of distances, a process that significantly affects the runtime. To improve computational efficiency, existing work often opts for estimating approximate distances rather than computing exact distances, at the cost of reduced… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 13 pages

  39. arXiv:2404.14979  [pdf, other

    cs.CV cs.AI

    SGFormer: Spherical Geometry Transformer for 360 Depth Estimation

    Authors: Junsong Zhang, Zisong Chen, Chunyu Lin, Lang Nie, Zhijie Shen, Junda Huang, Yao Zhao

    Abstract: Panoramic distortion poses a significant challenge in 360 depth estimation, particularly pronounced at the north and south poles. Existing methods either adopt a bi-projection fusion strategy to remove distortions or model long-range dependencies to capture global structures, which can result in either unclear structure or insufficient local perception. In this paper, we propose a spherical geomet… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  40. arXiv:2404.13733  [pdf, other

    cs.LG cs.AI cs.CV

    Elucidating the Design Space of Dataset Condensation

    Authors: Shitong Shao, Zikai Zhou, Huanran Chen, Zhiqiang Shen

    Abstract: Dataset condensation, a concept within data-centric learning, efficiently transfers critical attributes from an original dataset to a synthetic version, maintaining both diversity and realism. This approach significantly improves model training efficiency and is adaptable across multiple application areas. Previous methods in dataset condensation have faced challenges: some incur high computationa… ▽ More

    Submitted 6 May, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  41. arXiv:2404.11121  [pdf, other

    cs.CR cs.AI

    TransLinkGuard: Safeguarding Transformer Models Against Model Stealing in Edge Deployment

    Authors: Qinfeng Li, Zhiqiang Shen, Zhenghan Qin, Yangfan Xie, Xuhong Zhang, Tianyu Du, Jianwei Yin

    Abstract: Proprietary large language models (LLMs) have been widely applied in various scenarios. Additionally, deploying LLMs on edge devices is trending for efficiency and privacy reasons. However, edge deployment of proprietary LLMs introduces new security challenges: edge-deployed models are exposed as white-box accessible to users, enabling adversaries to conduct effective model stealing (MS) attacks.… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.07152 by other authors

  42. arXiv:2404.11014  [pdf, other

    cs.MA cs.AI

    Towards Multi-agent Reinforcement Learning based Traffic Signal Control through Spatio-temporal Hypergraphs

    Authors: Kang Wang, Zhishu Shen, Zhen Lei, Tiehua Zhang

    Abstract: Traffic signal control systems (TSCSs) are integral to intelligent traffic management, fostering efficient vehicle flow. Traditional approaches often simplify road networks into standard graphs, which results in a failure to consider the dynamic nature of traffic data at neighboring intersections, thereby neglecting higher-order interconnections necessary for real-time control. To address this, we… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  43. arXiv:2404.07976  [pdf, other

    cs.CV cs.AI

    Self-supervised Dataset Distillation: A Good Compression Is All You Need

    Authors: Muxin Zhou, Zeyuan Yin, Shitong Shao, Zhiqiang Shen

    Abstract: Dataset distillation aims to compress information from a large-scale original dataset to a new compact dataset while striving to preserve the utmost degree of the original data informational essence. Previous studies have predominantly concentrated on aligning the intermediate statistics between the original and distilled data, such as weight trajectory, features, gradient, BatchNorm, etc. In this… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  44. arXiv:2404.04216  [pdf, other

    cs.CE physics.comp-ph quant-ph

    Quantum-informed simulations for mechanics of materials: DFTB+MBD framework

    Authors: Zhaoxiang Shen, Raúl I. Sosa, Stéphane P. A. Bordas, Alexandre Tkatchenko, Jakub Lengiewicz

    Abstract: The macroscopic behaviors of materials are determined by interactions that occur at multiple lengths and time scales. Depending on the application, describing, predicting, and understanding these behaviors require models that rely on insights from electronic and atomic scales. In such cases, classical simplified approximations at those scales are insufficient, and quantum-based modeling is require… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  45. arXiv:2404.03604  [pdf, other

    math.OC cs.DS

    A Unified Algorithmic Framework for Dynamic Assortment Optimization under MNL Choice

    Authors: Shuo Sun, Rajan Udwani, Zuo-Jun Max Shen

    Abstract: We consider assortment and inventory planning problems with dynamic stockout-based substitution effects and no replenishment. We consider two settings: 1. Customers can see all available products when they arrive, which is commonly seen in physical stores. 2. The seller can choose to offer a subset of available products to each customer, which is typical on online platforms. Both settings are know… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  46. arXiv:2403.15464  [pdf, other

    cs.CL cs.AI cs.LG cs.MA

    LLMs-based Few-Shot Disease Predictions using EHR: A Novel Approach Combining Predictive Agent Reasoning and Critical Agent Instruction

    Authors: Hejie Cui, Zhuocheng Shen, Jieyu Zhang, Hui Shao, Lianhui Qin, Joyce C. Ho, Carl Yang

    Abstract: Electronic health records (EHRs) contain valuable patient data for health-related prediction tasks, such as disease prediction. Traditional approaches rely on supervised learning methods that require large labeled datasets, which can be expensive and challenging to obtain. In this study, we investigate the feasibility of applying Large Language Models (LLMs) to convert structured patient visit dat… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    ACM Class: J.3; I.2.7

  47. A Design Space for Intelligent and Interactive Writing Assistants

    Authors: Mina Lee, Katy Ilonka Gero, John Joon Young Chung, Simon Buckingham Shum, Vipul Raheja, Hua Shen, Subhashini Venugopalan, Thiemo Wambsganss, David Zhou, Emad A. Alghamdi, Tal August, Avinash Bhat, Madiha Zahrah Choksi, Senjuti Dutta, Jin L. C. Guo, Md Naimul Hoque, Yewon Kim, Simon Knight, Seyed Parsa Neshaei, Agnia Sergeyuk, Antonette Shibani, Disha Shrivastava, Lila Shroff, Jessi Stark, Sarah Sterman , et al. (11 additional authors not shown)

    Abstract: In our era of rapid technological advancement, the research landscape for writing assistants has become increasingly fragmented across various research communities. We seek to address this challenge by proposing a design space as a structured way to examine and explore the multidimensional space of intelligent and interactive writing assistants. Through a large community collaboration, we explore… ▽ More

    Submitted 26 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: Published as a conference paper at CHI 2024

  48. arXiv:2403.10496  [pdf, other

    cs.RO

    Reconfigurable Robot Identification from Motion Data

    Authors: Yuhang Hu, Yunzhe Wang, Ruibo Liu, Zhou Shen, Hod Lipson

    Abstract: Integrating Large Language Models (VLMs) and Vision-Language Models (VLMs) with robotic systems enables robots to process and understand complex natural language instructions and visual information. However, a fundamental challenge remains: for robots to fully capitalize on these advancements, they must have a deep understanding of their physical embodiment. The gap between AI models cognitive cap… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  49. arXiv:2403.10415  [pdf, other

    cs.AI

    Gradient based Feature Attribution in Explainable AI: A Technical Review

    Authors: Yongjie Wang, Tong Zhang, Xu Guo, Zhiqi Shen

    Abstract: The surge in black-box AI models has prompted the need to explain the internal mechanism and justify their reliability, especially in high-stakes applications, such as healthcare and autonomous driving. Due to the lack of a rigorous definition of explainable AI (XAI), a plethora of research related to explainability, interpretability, and transparency has been developed to explain and analyze the… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  50. arXiv:2403.09288  [pdf, other

    cs.CV cs.AI

    Adversarial Training with OCR Modality Perturbation for Scene-Text Visual Question Answering

    Authors: Zhixuan Shen, Haonan Luo, Sijia Li, Tianrui Li

    Abstract: Scene-Text Visual Question Answering (ST-VQA) aims to understand scene text in images and answer questions related to the text content. Most existing methods heavily rely on the accuracy of Optical Character Recognition (OCR) systems, and aggressive fine-tuning based on limited spatial location information and erroneous OCR text information often leads to inevitable overfitting. In this paper, we… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 6 pages, 3 figures, accepted by 2024 IEEE International Conference on Multimedia and Expo