Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 480 results for author: Chang, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.16252  [pdf, other

    cs.CL cs.AI cs.CV

    LawLuo: A Chinese Law Firm Co-run by LLM Agents

    Authors: Jingyun Sun, Chengxiao Dai, Zhongze Luo, Yangbo Chang, Yang Li

    Abstract: Large Language Models (LLMs) demonstrate substantial potential in delivering legal consultation services to users without a legal background, attributed to their superior text comprehension and generation capabilities. Nonetheless, existing Chinese legal LLMs limit interaction to a single model-user dialogue, unlike the collaborative consultations typical of law firms, where multiple staff members… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 11 pages, 13 figures, 2 tables

    ACM Class: I.2.1

  2. arXiv:2407.16095  [pdf, other

    cs.RO

    Robotically adjustable kinematics in a wrist-driven orthosis eases grasping across tasks

    Authors: Erin Y. Chang, Andrew I. W. McPherson, Hannah S. Stuart

    Abstract: Without finger function, people with C5-7 spinal cord injury (SCI) regularly utilize wrist extension to passively close the fingers and thumb together for grasping. Wearable assistive grasping devices often focus on this familiar wrist-driven technique to provide additional support and amplify grasp force. Despite recent research advances in modernizing these tools, people with SCI often abandon s… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 6 pages, 8 figures. Presented at the 2024 International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

  3. arXiv:2407.14136  [pdf, other

    cs.CV

    6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry

    Authors: Sungho Chun, Ju Yong Chang

    Abstract: This study addresses the nuanced challenge of estimating head translations within the context of six-degrees-of-freedom (6DoF) head pose estimation, placing emphasis on this aspect over the more commonly studied head rotations. Identifying a gap in existing methodologies, we recognized the underutilized potential synergy between facial geometry and head translation. To bridge this gap, we propose… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  4. arXiv:2407.12068  [pdf, other

    cs.LG cs.AI

    Learning on Graphs with Large Language Models(LLMs): A Deep Dive into Model Robustness

    Authors: Kai Guo, Zewen Liu, Zhikai Chen, Hongzhi Wen, Wei Jin, Jiliang Tang, Yi Chang

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across various natural language processing tasks. Recently, several LLMs-based pipelines have been developed to enhance learning on graphs with text attributes, showcasing promising performance. However, graphs are well-known to be susceptible to adversarial attacks and it remains unclear whether LLMs exhibit robustness in learn… ▽ More

    Submitted 28 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

  5. arXiv:2407.10051  [pdf, ps, other

    cs.IT

    The weight distributions of some linear codes derived from Kloosterman sums

    Authors: Mengzhen Zhao, Yanxun Chang

    Abstract: Linear codes with few weights have applications in data storage systems, secret sharing schemes, and authentication codes. In this paper, some kinds of p-ary linear codes with few weights are constructed by use of the given de ning set, where p is a prime. Their weight distributions are determined based on Kloosterman sums over nite elds. In addition, some linear codes we given is minimal.

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: 11 pages

    MSC Class: 94A24; 94B05

  6. arXiv:2407.08377  [pdf, other

    cs.CV

    Long-range Turbulence Mitigation: A Large-scale Dataset and A Coarse-to-fine Framework

    Authors: Shengqi Xu, Run Sun, Yi Chang, Shuning Cao, Xueyao Xiao, Luxin Yan

    Abstract: Long-range imaging inevitably suffers from atmospheric turbulence with severe geometric distortions due to random refraction of light. The further the distance, the more severe the disturbance. Despite existing research has achieved great progress in tackling short-range turbulence, there is less attention paid to long-range turbulence with significant distortions. To address this dilemma and adva… ▽ More

    Submitted 17 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by ECCV 2024

  7. arXiv:2407.05364  [pdf, other

    cs.LG

    PTaRL: Prototype-based Tabular Representation Learning via Space Calibration

    Authors: Hangting Ye, Wei Fan, Xiaozhuang Song, Shun Zheng, He Zhao, Dandan Guo, Yi Chang

    Abstract: Tabular data have been playing a mostly important role in diverse real-world fields, such as healthcare, engineering, finance, etc. With the recent success of deep learning, many tabular machine learning (ML) methods based on deep networks (e.g., Transformer, ResNet) have achieved competitive performance on tabular benchmarks. However, existing deep tabular ML methods suffer from the representatio… ▽ More

    Submitted 15 July, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted by ICLR 2024

  8. arXiv:2407.03925  [pdf, other

    cs.LG

    Reduced-Order Neural Operators: Learning Lagrangian Dynamics on Highly Sparse Graphs

    Authors: Hrishikesh Viswanath, Yue Chang, Julius Berner, Peter Yichen Chen, Aniket Bera

    Abstract: We present a neural operator architecture to simulate Lagrangian dynamics, such as fluid flow, granular flows, and elastoplasticity. Traditional numerical methods, such as the finite element method (FEM), suffer from long run times and large memory consumption. On the other hand, approaches based on graph neural networks are faster but still suffer from long computation times on dense graphs, whic… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  9. arXiv:2407.00383  [pdf, other

    cs.LG cs.AI

    FANFOLD: Graph Normalizing Flows-driven Asymmetric Network for Unsupervised Graph-Level Anomaly Detection

    Authors: Rui Cao, Shijie Xue, Jindong Li, Qi Wang, Yi Chang

    Abstract: Unsupervised graph-level anomaly detection (UGAD) has attracted increasing interest due to its widespread application. In recent studies, knowledge distillation-based methods have been widely used in unsupervised anomaly detection to improve model efficiency and generalization. However, the inherent symmetry between the source (teacher) and target (student) networks typically results in consistent… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  10. arXiv:2407.00118  [pdf, other

    cs.LG cs.AI

    From Efficient Multimodal Models to World Models: A Survey

    Authors: Xinji Mai, Zeng Tao, Junxiong Lin, Haoran Wang, Yang Chang, Yanlan Kang, Yan Wang, Wenqiang Zhang

    Abstract: Multimodal Large Models (MLMs) are becoming a significant research focus, combining powerful large language models with multimodal learning to perform complex tasks across different data modalities. This review explores the latest developments and challenges in MLMs, emphasizing their potential in achieving artificial general intelligence and as a pathway to world models. We provide an overview of… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  11. arXiv:2406.19228  [pdf, other

    cs.CL cs.AI cs.LG

    Tools Fail: Detecting Silent Errors in Faulty Tools

    Authors: Jimin Sun, So Yeon Min, Yingshan Chang, Yonatan Bisk

    Abstract: Tools have become a mainstay of LLMs, allowing them to retrieve knowledge not in their weights, to perform tasks on the web, and even to control robots. However, most ontologies and surveys of tool-use have assumed the core challenge for LLMs is choosing the tool. Instead, we introduce a framework for tools more broadly which guides us to explore a model's ability to detect "silent" tool errors, a… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 18 pages, 12 figures

  12. arXiv:2406.17386  [pdf, other

    math.OC cs.AI cs.LG

    Double Momentum Method for Lower-Level Constrained Bilevel Optimization

    Authors: Wanli Shi, Yi Chang, Bin Gu

    Abstract: Bilevel optimization (BO) has recently gained prominence in many machine learning applications due to its ability to capture the nested structure inherent in these problems. Recently, many hypergradient methods have been proposed as effective solutions for solving large-scale problems. However, current hypergradient methods for the lower-level constrained bilevel optimization (LCBO) problems need… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 27pages, 9 figures

  13. arXiv:2406.15848  [pdf, other

    cs.CV

    Quality-guided Skin Tone Enhancement for Portrait Photography

    Authors: Shiqi Gao, Huiyu Duan, Xinyue Li, Kang Fu, Yicong Peng, Qihang Xu, Yuanyuan Chang, Jia Wang, Xiongkuo Min, Guangtao Zhai

    Abstract: In recent years, learning-based color and tone enhancement methods for photos have become increasingly popular. However, most learning-based image enhancement methods just learn a mapping from one distribution to another based on one dataset, lacking the ability to adjust images continuously and controllably. It is important to enable the learning-based enhancement models to adjust an image contin… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  14. arXiv:2406.15119  [pdf, other

    cs.SD cs.AI eess.AS

    Speech Emotion Recognition under Resource Constraints with Data Distillation

    Authors: Yi Chang, Zhao Ren, Zhonghao Zhao, Thanh Tam Nguyen, Kun Qian, Tanja Schultz, Björn W. Schuller

    Abstract: Speech emotion recognition (SER) plays a crucial role in human-computer interaction. The emergence of edge devices in the Internet of Things (IoT) presents challenges in constructing intricate deep learning models due to constraints in memory and computational resources. Moreover, emotional speech data often contains private information, raising concerns about privacy leakage during the deployment… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  15. arXiv:2406.14517  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    PostMark: A Robust Blackbox Watermark for Large Language Models

    Authors: Yapei Chang, Kalpesh Krishna, Amir Houmansadr, John Wieting, Mohit Iyyer

    Abstract: The most effective techniques to detect LLM-generated text rely on inserting a detectable signature -- or watermark -- during the model's decoding process. Most existing watermarking methods require access to the underlying LLM's logits, which LLM API providers are loath to share due to fears of model distillation. As such, these watermarks must be implemented independently by each LLM provider. I… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: preprint; 18 pages, 5 figures

  16. arXiv:2406.13153  [pdf, other

    cs.CV

    SwinStyleformer is a favorable choice for image inversion

    Authors: Jiawei Mao, Guangyi Zhao, Xuesong Yin, Yuanqi Chang

    Abstract: This paper proposes the first pure Transformer structure inversion network called SwinStyleformer, which can compensate for the shortcomings of the CNNs inversion framework by handling long-range dependencies and learning the global structure of objects. Experiments found that the inversion network with the Transformer backbone could not successfully invert the image. The above phenomena arise fro… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  17. arXiv:2406.12587  [pdf, other

    cs.CV

    Restorer: Solving Multiple Image Restoration Tasks with One Set of Parameters

    Authors: Jiawei Mao, Xuesong Yin, Yuanqi Chang

    Abstract: Although there are many excellent solutions in image restoration, the fact that they are specifically designed for a single image restoration task may prevent them from being state-of-the-art (SOTA) in other types of image restoration tasks. While some approaches require considering multiple image restoration tasks, they are still not sufficient for the requirements of the real world and may suffe… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  18. arXiv:2406.12585  [pdf, other

    cs.CL cs.AI

    Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling

    Authors: Yao-Ching Yu, Chun-Chih Kuo, Ziqi Ye, Yu-Cheng Chang, Yueh-Se Li

    Abstract: Ensembling multiple models has always been an effective approach to push the limits of existing performance and is widely used in classification tasks by simply averaging the classification probability vectors from multiple classifiers to achieve better accuracy. However, in the thriving open-source Large Language Model (LLM) community, ensembling methods are rare and typically limited to ensembli… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  19. arXiv:2406.10272  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Connected Speech-Based Cognitive Assessment in Chinese and English

    Authors: Saturnino Luz, Sofia De La Fuente Garcia, Fasih Haider, Davida Fromm, Brian MacWhinney, Alyssa Lanzi, Ya-Ning Chang, Chia-Ju Chou, Yi-Chien Liu

    Abstract: We present a novel benchmark dataset and prediction tasks for investigating approaches to assess cognitive function through analysis of connected speech. The dataset consists of speech samples and clinical information for speakers of Mandarin Chinese and English with different levels of cognitive impairment as well as individuals with normal cognition. These data have been carefully matched by age… ▽ More

    Submitted 18 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: To appear in Proceedings of Interspeech 2024

    ACM Class: J.3; I.5.4

  20. arXiv:2406.08855  [pdf, other

    cs.RO

    Trajectory Planning for Autonomous Driving in Unstructured Scenarios Based on Graph Neural Network and Numerical Optimization

    Authors: Sumin Zhang, Kuo Li, Rui He, Zhiwei Meng, Yupeng Chang, Xiaosong Jin, Ri Bai

    Abstract: In unstructured environments, obstacles are diverse and lack lane markings, making trajectory planning for intelligent vehicles a challenging task. Traditional trajectory planning methods typically involve multiple stages, including path planning, speed planning, and trajectory optimization. These methods require the manual design of numerous parameters for each stage, resulting in significant wor… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  21. arXiv:2406.05191  [pdf, other

    cs.CV

    DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

    Authors: Shaurya Dewan, Rushikesh Zawar, Prakanshul Saxena, Yingshan Chang, Andrew Luo, Yonatan Bisk

    Abstract: Text-to-image diffusion models have made significant progress in generating naturalistic images from textual inputs, and demonstrate the capacity to learn and represent complex visual-semantic relationships. While these diffusion models have achieved remarkable success, the underlying mechanisms driving their performance are not yet fully accounted for, with many unanswered questions surrounding w… ▽ More

    Submitted 12 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  22. arXiv:2406.04368  [pdf, other

    cs.CL cs.AI cs.CY

    SocialNLP Fake-EmoReact 2021 Challenge Overview: Predicting Fake Tweets from Their Replies and GIFs

    Authors: Chien-Kun Huang, Yi-Ting Chang, Lun-Wei Ku, Cheng-Te Li, Hong-Han Shuai

    Abstract: This paper provides an overview of the Fake-EmoReact 2021 Challenge, held at the 9th SocialNLP Workshop, in conjunction with NAACL 2021. The challenge requires predicting the authenticity of tweets using reply context and augmented GIF categories from EmotionGIF dataset. We offer the Fake-EmoReact dataset with more than 453k as the experimental materials, where every tweet is labeled with authenti… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

  23. arXiv:2406.03102  [pdf, other

    cs.LG cs.AI

    DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays

    Authors: Bo Xia, Yilun Kong, Yongzhe Chang, Bo Yuan, Zhiheng Li, Xueqian Wang, Bin Liang

    Abstract: Classic reinforcement learning (RL) frequently confronts challenges in tasks involving delays, which cause a mismatch between received observations and subsequent actions, thereby deviating from the Markov assumption. Existing methods usually tackle this issue with end-to-end solutions using state augmentation. However, these black-box approaches often involve incomprehensible processes and redund… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  24. arXiv:2406.01033  [pdf

    cs.CV cs.LG cs.MM

    Generalized Jersey Number Recognition Using Multi-task Learning With Orientation-guided Weight Refinement

    Authors: Yung-Hui Lin, Yu-Wen Chang, Huang-Chia Shih, Takahiro Ogawa

    Abstract: Jersey number recognition (JNR) has always been an important task in sports analytics. Improving recognition accuracy remains an ongoing challenge because images are subject to blurring, occlusion, deformity, and low resolution. Recent research has addressed these problems using number localization and optical character recognition. Some approaches apply player identification schemes to image sequ… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 10 pages, 6 figures, 5 tables

  25. arXiv:2405.20404  [pdf, other

    cs.CL cs.LG

    XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution

    Authors: Yurui Chang, Bochuan Cao, Yujia Wang, Jinghui Chen, Lu Lin

    Abstract: Large Language Models (LLMs) have demonstrated impressive performances in complex text generation tasks. However, the contribution of the input prompt to the generated content still remains obscure to humans, underscoring the necessity of elucidating and explaining the causality between input and output pairs. Existing works for providing prompt-specific explanation often confine model output to b… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  26. arXiv:2405.20131  [pdf, other

    cs.CL cs.AI

    Language Models Need Inductive Biases to Count Inductively

    Authors: Yingshan Chang, Yonatan Bisk

    Abstract: Counting is a fundamental example of generalization, whether viewed through the mathematical lens of Peano's axioms defining the natural numbers or the cognitive science literature for children learning to count. The argument holds for both cases that learning to count means learning to count infinitely. While few papers have tried to distill transformer "reasoning" to the simplest case of countin… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  27. arXiv:2405.19718  [pdf, other

    cs.CV

    LED: A Large-scale Real-world Paired Dataset for Event Camera Denoising

    Authors: Yuxing Duan, Shihan Peng, Lin Zhu, Wei Zhang, Yi Chang, Sheng Zhong, Luxin Yan

    Abstract: Event camera has significant advantages in capturing dynamic scene information while being prone to noise interference, particularly in challenging conditions like low threshold and low illumination. However, most existing research focuses on gentle situations, hindering event camera applications in realistic complex scenarios. To tackle this limitation and advance the field, we construct a new pa… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR 2024

  28. arXiv:2405.18816  [pdf, other

    cs.CV cs.LG

    Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching

    Authors: Yasi Zhang, Peiyu Yu, Yaxuan Zhu, Yingshan Chang, Feng Gao, Ying Nian Wu, Oscar Leong

    Abstract: Generative models based on flow matching have attracted significant attention for their simplicity and superior performance in high-resolution image synthesis. By leveraging the instantaneous change-of-variables formula, one can directly compute image likelihoods from a learned flow, making them enticing candidates as priors for downstream tasks such as inverse problems. In particular, a natural a… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  29. arXiv:2405.16755  [pdf, other

    cs.LG cs.AI cs.DB

    CHESS: Contextual Harnessing for Efficient SQL Synthesis

    Authors: Shayan Talaei, Mohammadreza Pourreza, Yu-Chen Chang, Azalia Mirhoseini, Amin Saberi

    Abstract: Utilizing large language models (LLMs) for transforming natural language questions into SQL queries (text-to-SQL) is a promising yet challenging approach, particularly when applied to real-world databases with complex and extensive schemas. In particular, effectively incorporating data catalogs and database values for SQL generation remains an obstacle, leading to suboptimal solutions. We address… ▽ More

    Submitted 27 June, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  30. arXiv:2405.15808  [pdf, other

    cs.AI

    Ensuring Ground Truth Accuracy in Healthcare with the EVINCE framework

    Authors: Edward Y. Chang

    Abstract: Misdiagnosis is a significant issue in healthcare, leading to harmful consequences for patients. The propagation of mislabeled data through machine learning models into clinical practice is unacceptable. This paper proposes EVINCE, a system designed to 1) improve diagnosis accuracy and 2) rectify misdiagnoses and minimize training data errors. EVINCE stands for Entropy Variation through Informatio… ▽ More

    Submitted 28 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 23 pages, 4 tables, 4 figures

    ACM Class: I.2.7

  31. arXiv:2405.14573  [pdf, other

    cs.AI cs.LG

    AndroidWorld: A Dynamic Benchmarking Environment for Autonomous Agents

    Authors: Christopher Rawles, Sarah Clinckemaillie, Yifan Chang, Jonathan Waltz, Gabrielle Lau, Marybeth Fair, Alice Li, William Bishop, Wei Li, Folawiyo Campbell-Ajala, Daniel Toyama, Robert Berry, Divya Tyamagundlu, Timothy Lillicrap, Oriana Riva

    Abstract: Autonomous agents that execute human tasks by controlling computers can enhance human productivity and application accessibility. However, progress in this field will be driven by realistic and reproducible benchmarks. We present AndroidWorld, a fully functional Android environment that provides reward signals for 116 programmatic tasks across 20 real-world Android apps. Unlike existing interactiv… ▽ More

    Submitted 10 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  32. arXiv:2405.13368  [pdf, other

    cs.IT

    Static Deep Q-learning for Green Downlink C-RAN

    Authors: Yuchao Chang, Hongli Wang, Wen Chen, Yonghui Li, Naofal Al-Dhahir

    Abstract: Power saving is a main pillar in the operation of wireless communication systems. In this paper, we investigate cloud radio access network (C-RAN) capability to reduce power consumption based on the user equipment (UE) requirement. Aiming to save the long-term C-RAN energy consumption, an optimization problem is formulated to manage the downlink power without degrading the UE requirement by design… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  33. arXiv:2405.12954  [pdf, other

    cs.LG cs.AI

    A Method on Searching Better Activation Functions

    Authors: Haoyuan Sun, Zihao Wu, Bo Xia, Pu Chang, Zibin Dong, Yifu Yuan, Yongzhe Chang, Xueqian Wang

    Abstract: The success of artificial neural networks (ANNs) hinges greatly on the judicious selection of an activation function, introducing non-linearity into network and enabling them to model sophisticated relationships in data. However, the search of activation functions has largely relied on empirical knowledge in the past, lacking theoretical guidance, which has hindered the identification of more effe… ▽ More

    Submitted 22 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: 16 pages,3 figures

  34. arXiv:2405.10516  [pdf, other

    cs.CL cs.AI

    Language Models can Evaluate Themselves via Probability Discrepancy

    Authors: Tingyu Xia, Bowen Yu, Yuan Wu, Yi Chang, Chang Zhou

    Abstract: In this paper, we initiate our discussion by demonstrating how Large Language Models (LLMs), when tasked with responding to queries, display a more even probability distribution in their answers if they are more adept, as opposed to their less skilled counterparts. Expanding on this foundational insight, we propose a new self-evaluation method ProbDiff for assessing the efficacy of various LLMs. T… ▽ More

    Submitted 8 July, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: ACL 2024 Findings

  35. arXiv:2405.07076  [pdf, other

    cs.CL cs.AI

    Integrating Emotional and Linguistic Models for Ethical Compliance in Large Language Models

    Authors: Edward Y. Chang

    Abstract: This research develops advanced methodologies for Large Language Models (LLMs) to better manage linguistic behaviors related to emotions and ethics. We introduce DIKE, an adversarial framework that enhances the LLMs' ability to internalize and reflect global human values, adapting to varied cultural contexts to promote transparency and trust among users. The methodology involves detailed modeling… ▽ More

    Submitted 13 May, 2024; v1 submitted 11 May, 2024; originally announced May 2024.

    Comments: 29 pages, 10 tables, 6 figures

    ACM Class: I.2.7

  36. arXiv:2405.05132  [pdf, other

    cs.DC cs.DS

    Low-Distortion Clustering in Bounded Growth Graphs

    Authors: Yi-Jun Chang, Varsha Dani, Thomas P. Hayes

    Abstract: The well-known clustering algorithm of Miller, Peng, and Xu (SPAA 2013) is useful for many applications, including low-diameter decomposition and low-energy distributed algorithms. One nice property of their clustering, shown in previous work by Chang, Dani, Hayes, and Pettie (PODC 2020), is that distances in the cluster graph are rescaled versions of distances in the original graph, up to an… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  37. arXiv:2405.03908  [pdf, other

    cs.DC cs.DS

    Deterministic Expander Routing: Faster and More Versatile

    Authors: Yi-Jun Chang, Shang-En Huang, Hsin-Hao Su

    Abstract: We consider the expander routing problem formulated by Ghaffari, Kuhn, and Su (PODC 2017), where the goal is to route all the tokens to their destinations given that each vertex is the source and the destination of at most $°(v)$ tokens. They developed $\textit{randomized algorithms}$ that solve this problem in $\text{poly}(φ^{-1}) \cdot 2^{O(\sqrt{\log n \log \log n})}$ rounds in the… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: Accepted to PODC 2024

  38. arXiv:2405.03371  [pdf, other

    cs.CL

    Explainable Fake News Detection With Large Language Model via Defense Among Competing Wisdom

    Authors: Bo Wang, Jing Ma, Hongzhan Lin, Zhiwei Yang, Ruichao Yang, Yuan Tian, Yi Chang

    Abstract: Most fake news detection methods learn latent feature representations based on neural networks, which makes them black boxes to classify a piece of news without giving any justification. Existing explainable systems generate veracity justifications from investigative journalism, which suffer from debunking delayed and low efficiency. Recent studies simply assume that the justification is equivalen… ▽ More

    Submitted 20 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 12 pages, WWW'2024

  39. arXiv:2405.02814  [pdf, other

    cs.CL

    NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional Stimuli

    Authors: Xu Wang, Cheng Li, Yi Chang, Jindong Wang, Yuan Wu

    Abstract: Large Language Models (LLMs) have become integral to a wide spectrum of applications, ranging from traditional computing tasks to advanced artificial intelligence (AI) applications. This widespread adoption has spurred extensive research into LLMs across various disciplines, including the social sciences. Notably, studies have revealed that LLMs possess emotional intelligence, which can be further… ▽ More

    Submitted 12 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: This paper has been accepted by IJCAI 2024

  40. arXiv:2405.02695  [pdf, ps, other

    cs.DS cs.DC

    Improved All-Pairs Approximate Shortest Paths in Congested Clique

    Authors: Hong Duc Bui, Shashwat Chandra, Yi-Jun Chang, Michal Dory, Dean Leitersdorf

    Abstract: In this paper, we present new algorithms for approximating All-Pairs Shortest Paths (APSP) in the Congested Clique model. We present randomized algorithms for weighted undirected graphs. Our first contribution is an $O(1)$-approximate APSP algorithm taking just $O(\log \log \log n)$ rounds. Prior to our work, the fastest algorithms that give an $O(1)$-approximation for APSP take… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  41. CVTGAD: Simplified Transformer with Cross-View Attention for Unsupervised Graph-level Anomaly Detection

    Authors: Jindong Li, Qianli Xing, Qi Wang, Yi Chang

    Abstract: Unsupervised graph-level anomaly detection (UGAD) has received remarkable performance in various critical disciplines, such as chemistry analysis and bioinformatics. Existing UGAD paradigms often adopt data augmentation techniques to construct multiple views, and then employ different strategies to obtain representations from different views for jointly conducting UGAD. However, most previous work… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  42. arXiv:2405.02288  [pdf, other

    cs.CV cs.AI cs.RO

    Prospective Role of Foundation Models in Advancing Autonomous Vehicles

    Authors: Jianhua Wu, Bingzhao Gao, Jincheng Gao, Jianhao Yu, Hongqing Chu, Qiankun Yu, Xun Gong, Yi Chang, H. Eric Tseng, Hong Chen, Jie Chen

    Abstract: With the development of artificial intelligence and breakthroughs in deep learning, large-scale Foundation Models (FMs), such as GPT, Sora, etc., have achieved remarkable results in many fields including natural language processing and computer vision. The application of FMs in autonomous driving holds considerable promise. For example, they can contribute to enhancing scene understanding and reas… ▽ More

    Submitted 17 May, 2024; v1 submitted 8 December, 2023; originally announced May 2024.

    Comments: 45 pages,8 figures

  43. arXiv:2404.13696  [pdf, other

    cs.RO

    Clio: Real-time Task-Driven Open-Set 3D Scene Graphs

    Authors: Dominic Maggio, Yun Chang, Nathan Hughes, Matthew Trang, Dan Griffith, Carlyn Dougherty, Eric Cristofalo, Lukas Schmid, Luca Carlone

    Abstract: Modern tools for class-agnostic image segmentation (e.g., SegmentAnything) and open-set semantic understanding (e.g., CLIP) provide unprecedented opportunities for robot perception and mapping. While traditional closed-set metric-semantic maps were restricted to tens or hundreds of semantic classes, we can now build maps with a plethora of objects and countless semantic variations. This leaves us… ▽ More

    Submitted 29 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  44. arXiv:2404.13071  [pdf, other

    cs.CL cs.AI

    Modeling Emotions and Ethics with Large Language Models

    Authors: Edward Y. Chang

    Abstract: This paper explores the integration of human-like emotions and ethical considerations into Large Language Models (LLMs). We first model eight fundamental human emotions, presented as opposing pairs, and employ collaborative LLMs to reinterpret and express these emotions across a spectrum of intensity. Our focus extends to embedding a latent ethical dimension within LLMs, guided by a novel self-sup… ▽ More

    Submitted 25 June, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 8 pages, 4 figures, 3 tables

    ACM Class: I.2.0

    Journal ref: IEEE MIPR 2024

  45. arXiv:2404.12930  [pdf, other

    cs.DC cs.DS

    Fast Broadcast in Highly Connected Networks

    Authors: Shashwat Chandra, Yi-Jun Chang, Michal Dory, Mohsen Ghaffari, Dean Leitersdorf

    Abstract: We revisit the classic broadcast problem, wherein we have $k$ messages, each composed of $O(\log{n})$ bits, distributed arbitrarily across a network. The objective is to broadcast these messages to all nodes in the network. In the distributed CONGEST model, a textbook algorithm solves this problem in $O(D+k)$ rounds, where $D$ is the diameter of the graph. While the $O(D)$ term in the round comple… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  46. arXiv:2404.12445  [pdf

    cs.LG cs.CE physics.chem-ph

    Adaptive Catalyst Discovery Using Multicriteria Bayesian Optimization with Representation Learning

    Authors: Jie Chen, Pengfei Ou, Yuxin Chang, Hengrui Zhang, Xiao-Yan Li, Edward H. Sargent, Wei Chen

    Abstract: High-performance catalysts are crucial for sustainable energy conversion and human health. However, the discovery of catalysts faces challenges due to the absence of efficient approaches to navigating vast and high-dimensional structure and composition spaces. In this study, we propose a high-throughput computational catalyst screening approach integrating density functional theory (DFT) and Bayes… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  47. arXiv:2404.11884  [pdf, other

    cs.CV

    Seeing Motion at Nighttime with an Event Camera

    Authors: Haoyue Liu, Shihan Peng, Lin Zhu, Yi Chang, Hanyu Zhou, Luxin Yan

    Abstract: We focus on a very challenging task: imaging at nighttime dynamic scenes. Most previous methods rely on the low-light enhancement of a conventional RGB camera. However, they would inevitably face a dilemma between the long exposure time of nighttime and the motion blur of dynamic scenes. Event cameras react to dynamic changes with higher temporal resolution (microsecond) and higher dynamic range (… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  48. arXiv:2404.01598  [pdf, other

    cs.LG cs.AI cs.RO

    Extremum-Seeking Action Selection for Accelerating Policy Optimization

    Authors: Ya-Chien Chang, Sicun Gao

    Abstract: Reinforcement learning for control over continuous spaces typically uses high-entropy stochastic policies, such as Gaussian distributions, for local exploration and estimating policy gradient to optimize performance. Many robotic control problems deal with complex unstable dynamics, where applying actions that are off the feasible control manifolds can quickly lead to undesirable divergence. In su… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  49. arXiv:2404.01261  [pdf, other

    cs.CL cs.AI

    FABLES: Evaluating faithfulness and content selection in book-length summarization

    Authors: Yekyung Kim, Yapei Chang, Marzena Karpinska, Aparna Garimella, Varun Manjunatha, Kyle Lo, Tanya Goyal, Mohit Iyyer

    Abstract: While long-context large language models (LLMs) can technically summarize book-length documents (>100K tokens), the length and complexity of the documents have so far prohibited evaluations of input-dependent aspects like faithfulness. In this paper, we conduct the first large-scale human evaluation of faithfulness and content selection on LLM-generated summaries of fictional books. Our study miti… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: preprint - 39 pages

  50. arXiv:2403.16394  [pdf, other

    cs.CL cs.AI

    Skews in the Phenomenon Space Hinder Generalization in Text-to-Image Generation

    Authors: Yingshan Chang, Yasi Zhang, Zhiyuan Fang, Yingnian Wu, Yonatan Bisk, Feng Gao

    Abstract: The literature on text-to-image generation is plagued by issues of faithfully composing entities with relations. But there lacks a formal understanding of how entity-relation compositions can be effectively learned. Moreover, the underlying phenomenon space that meaningfully reflects the problem structure is not well-defined, leading to an arms race for larger quantities of data in the hope that g… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.