Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 130 results for author: Yin, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12142  [pdf, other

    cs.CL cs.AI

    MDD-5k: A New Diagnostic Conversation Dataset for Mental Disorders Synthesized via Neuro-Symbolic LLM Agents

    Authors: Congchi Yin, Feng Li, Shu Zhang, Zike Wang, Jun Shao, Piji Li, Jianhua Chen, Xun Jiang

    Abstract: The clinical diagnosis of most mental disorders primarily relies on the conversations between psychiatrist and patient. The creation of such diagnostic conversation datasets is promising to boost the AI mental healthcare community. However, directly collecting the conversations in real diagnosis scenarios is near impossible due to stringent privacy and ethical considerations. To address this issue… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  2. arXiv:2408.08328  [pdf, other

    cs.AI cs.LG stat.AP

    Unleash The Power of Pre-Trained Language Models for Irregularly Sampled Time Series

    Authors: Weijia Zhang, Chenlong Yin, Hao Liu, Hui Xiong

    Abstract: Pre-trained Language Models (PLMs), such as ChatGPT, have significantly advanced the field of natural language processing. This progress has inspired a series of innovative studies that explore the adaptation of PLMs to time series analysis, intending to create a unified foundation model that addresses various time series analytical tasks. However, these efforts predominantly focus on Regularly Sa… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  3. arXiv:2408.03586  [pdf, other

    cs.HC

    Clinical Challenges and AI Opportunities in Decision-Making for Cancer Treatment-Induced Cardiotoxicity

    Authors: Siyi Wu, Weidan Cao, Shihan Fu, Bingsheng Yao, Ziqi Yang, Changchang Yin, Varun Mishra, Daniel Addison, Ping Zhang, Dakuo Wang

    Abstract: Cardiotoxicity induced by cancer treatment has become a major clinical concern, affecting the long-term survival and quality of life of cancer patients. Effective clinical decision-making, including the detection of cancer treatment-induced cardiotoxicity and the monitoring of associated symptoms, remains a challenging task for clinicians. This study investigates the current practices and needs of… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: In Submission

  4. arXiv:2407.16999  [pdf, other

    cs.LG cs.AI cs.HC

    SepsisLab: Early Sepsis Prediction with Uncertainty Quantification and Active Sensing

    Authors: Changchang Yin, Pin-Yu Chen, Bingsheng Yao, Dakuo Wang, Jeffrey Caterino, Ping Zhang

    Abstract: Sepsis is the leading cause of in-hospital mortality in the USA. Early sepsis onset prediction and diagnosis could significantly improve the survival of sepsis patients. Existing predictive models are usually trained on high-quality data with few missing information, while missing values widely exist in real-world clinical scenarios (especially in the first hours of admissions to the hospital), wh… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: To be published in KDD 2024

    MSC Class: 68T07 (primary) 92C50 (secondary) ACM Class: H.2.8; I.2.1; J.3

  5. arXiv:2407.16237  [pdf, other

    cs.AR cs.AI cs.LG

    OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection

    Authors: Fan Cui, Chenyang Yin, Kexing Zhou, Youwei Xiao, Guangyu Sun, Qiang Xu, Qipeng Guo, Demin Song, Dahua Lin, Xingcheng Zhang, Yun, Liang

    Abstract: Recent studies have demonstrated the significant potential of Large Language Models (LLMs) in generating Register Transfer Level (RTL) code, with notable advancements showcased by commercial models such as GPT-4 and Claude3-Opus. However, these proprietary LLMs often raise concerns regarding privacy and security. While open-source LLMs offer solutions to these concerns, they typically underperform… ▽ More

    Submitted 2 September, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

  6. arXiv:2407.12768  [pdf, other

    quant-ph cs.CC cs.IT math-ph physics.atom-ph

    A polynomial-time classical algorithm for noisy quantum circuits

    Authors: Thomas Schuster, Chao Yin, Xun Gao, Norman Y. Yao

    Abstract: We provide a polynomial-time classical algorithm for noisy quantum circuits. The algorithm computes the expectation value of any observable for any circuit, with a small average error over input states drawn from an ensemble (e.g. the computational basis). Our approach is based upon the intuition that noise exponentially damps non-local correlations relative to local correlations. This enables one… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 11 pages, 3 figures + 22 page Supplementary Information

  7. arXiv:2407.04272  [pdf, other

    cs.LG cs.DC

    Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression

    Authors: Hao Feng, Boyuan Zhang, Fanjiang Ye, Min Si, Ching-Hsiang Chu, Jiannan Tian, Chunxing Yin, Summer Deng, Yuchen Hao, Pavan Balaji, Tong Geng, Dingwen Tao

    Abstract: DLRM is a state-of-the-art recommendation system model that has gained widespread adoption across various industry applications. The large size of DLRM models, however, necessitates the use of multiple devices/GPUs for efficient training. A significant bottleneck in this process is the time-consuming all-to-all communication required to collect embedding data from all devices. To mitigate this, we… ▽ More

    Submitted 25 August, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: camera-ready version for SC '24

  8. arXiv:2407.02730  [pdf, other

    cs.CV cs.AI

    MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context

    Authors: Zishan Gu, Changchang Yin, Fenglin Liu, Ping Zhang

    Abstract: Large Vision Language Models (LVLMs) have recently achieved superior performance in various tasks on natural image and text data, which inspires a large amount of studies for LVLMs fine-tuning and training. Despite their advancements, there has been scant research on the robustness of these models against hallucination when fine-tuned on smaller datasets. In this study, we introduce a new benchmar… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  9. arXiv:2406.08445  [pdf, other

    eess.AS cs.LG cs.SD

    SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models

    Authors: Chun Yin, Tai-Shih Chi, Yu Tsao, Hsin-Min Wang

    Abstract: Representations from pre-trained speech foundation models (SFMs) have shown impressive performance in many downstream tasks. However, the potential benefits of incorporating pre-trained SFM representations into speaker voice similarity assessment have not been thoroughly investigated. In this paper, we propose SVSNet+, a model that integrates pre-trained SFM representations to improve performance… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to INTERSPEECH 2024

  10. arXiv:2406.01026  [pdf, other

    cs.CL

    Strengthened Symbol Binding Makes Large Language Models Reliable Multiple-Choice Selectors

    Authors: Mengge Xue, Zhenyu Hu, Liqun Liu, Kuo Liao, Shuang Li, Honglin Han, Meng Zhao, Chengguo Yin

    Abstract: Multiple-Choice Questions (MCQs) constitute a critical area of research in the study of Large Language Models (LLMs). Previous works have investigated the selection bias problem in MCQs within few-shot scenarios, in which the LLM's performance may be influenced by the presentation of answer choices, leaving the selection bias during Supervised Fine-Tuning (SFT) unexplored. In this paper, we reveal… ▽ More

    Submitted 6 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accept at ACL2024 Main

    Journal ref: ACL 2024

  11. arXiv:2405.19763  [pdf, other

    cs.CL

    Enhancing Reinforcement Learning with Label-Sensitive Reward for Natural Language Understanding

    Authors: Kuo Liao, Shuang Li, Meng Zhao, Liqun Liu, Mengge Xue, Zhenyu Hu, Honglin Han, Chengguo Yin

    Abstract: Recent strides in large language models (LLMs) have yielded remarkable performance, leveraging reinforcement learning from human feedback (RLHF) to significantly enhance generation and alignment capabilities. However, RLHF encounters numerous challenges, including the objective mismatch issue, leading to suboptimal performance in Natural Language Understanding (NLU) tasks. To address this limitati… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accept at ACL2024 Main

  12. arXiv:2405.18203  [pdf, other

    cs.CL

    IAPT: Instruction-Aware Prompt Tuning for Large Language Models

    Authors: Wei Zhu, Aaron Xuxiang Tian, Congrui Yin, Yuan Ni, Xiaoling Wang, Guotong Xie

    Abstract: Soft prompt tuning is a widely studied parameter-efficient fine-tuning method. However, it has a clear drawback: many soft tokens must be inserted into the input sequences to guarantee downstream performance. As a result, soft prompt tuning is less considered than Low-rank adaptation (LoRA) in the large language modeling (LLM) era. In this work, we propose a novel prompt tuning method, Instruction… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by ACL-2024

  13. arXiv:2405.11640  [pdf, other

    cs.AI cs.CL cs.CV

    Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical Reasoning

    Authors: Zishan Gu, Fenglin Liu, Changchang Yin, Ping Zhang

    Abstract: The adoption of large language models (LLMs) in healthcare has attracted significant research interest. However, their performance in healthcare remains under-investigated and potentially limited, due to i) they lack rich domain-specific knowledge and medical reasoning skills; and ii) most state-of-the-art LLMs are unimodal, text-only models that cannot directly process multimodal inputs. To this… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  14. arXiv:2405.11597  [pdf, other

    cs.CL cs.AI

    Language Reconstruction with Brain Predictive Coding from fMRI Data

    Authors: Congchi Yin, Ziyi Ye, Piji Li

    Abstract: Many recent studies have shown that the perception of speech can be decoded from brain signals and subsequently reconstructed as continuous language. However, there is a lack of neurological basis for how the semantic information embedded within brain signals can be used more effectively to guide language reconstruction. The theory of predictive coding suggests that human brain naturally engages i… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

  15. arXiv:2405.03943  [pdf, other

    cs.LG cs.AI

    Predictive Modeling with Temporal Graphical Representation on Electronic Health Records

    Authors: Jiayuan Chen, Changchang Yin, Yuanlong Wang, Ping Zhang

    Abstract: Deep learning-based predictive models, leveraging Electronic Health Records (EHR), are receiving increasing attention in healthcare. An effective representation of a patient's EHR should hierarchically encompass both the temporal relationships between historical visits and medical events, and the inherent structural information within these elements. Existing patient representation methods can be… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: IJCAI 2024 main track

  16. arXiv:2403.20296  [pdf, other

    cs.IR

    Aiming at the Target: Filter Collaborative Information for Cross-Domain Recommendation

    Authors: Hanyu Li, Weizhi Ma, Peijie Sun, Jiayu Li, Cunxiang Yin, Yancheng He, Guoqiang Xu, Min Zhang, Shaoping Ma

    Abstract: Cross-domain recommender (CDR) systems aim to enhance the performance of the target domain by utilizing data from other related domains. However, irrelevant information from the source domain may instead degrade target domain performance, which is known as the negative transfer problem. There have been some attempts to address this problem, mostly by designing adaptive representations for overlapp… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted by SIGIR 2024

  17. arXiv:2403.16702  [pdf, other

    cs.CL cs.IR cs.SE

    ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search

    Authors: Zehan Li, Jianfei Zhang, Chuantao Yin, Yuanxin Ouyang, Wenge Rong

    Abstract: Retrieval-based code question answering seeks to match user queries in natural language to relevant code snippets. Previous approaches typically rely on pretraining models using crafted bi-modal and uni-modal datasets to align text and code representations. In this paper, we introduce ProCQA, a large-scale programming question answering dataset extracted from the StackOverflow community, offering… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted to LREC-COLING 2024

  18. arXiv:2402.15602  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    Minimax Optimality of Score-based Diffusion Models: Beyond the Density Lower Bound Assumptions

    Authors: Kaihong Zhang, Caitlyn H. Yin, Feng Liang, Jingbo Liu

    Abstract: We study the asymptotic error of score-based diffusion model sampling in large-sample scenarios from a non-parametric statistics perspective. We show that a kernel-based score estimator achieves an optimal mean square error of $\widetilde{O}\left(n^{-1} t^{-\frac{d+2}{2}}(t^{\frac{d}{2}} \vee 1)\right)$ for the score function of $p_0*\mathcal{N}(0,t\boldsymbol{I}_d)$, where $n$ and $d$ represent t… ▽ More

    Submitted 23 July, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:60134-60178, 2024

  19. arXiv:2402.01330  [pdf, other

    cs.NI

    Video Semantic Communication with Major Object Extraction and Contextual Video Encoding

    Authors: Haopeng Li, Haonan Tong, Sihua Wang, Nuocheng Yang, Zhaohui Yang, Changchuan Yin

    Abstract: This paper studies an end-to-end video semantic communication system for massive communication. In the considered system, the transmitter must continuously send the video to the receiver to facilitate character reconstruction in immersive applications, such as interactive video conference. However, transmitting the original video information with substantial amounts of data poses a challenge to th… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: 6 pages, 9 figures, accepted by IEEE WCNC wksp 2024

  20. arXiv:2401.12079  [pdf, other

    cs.MA cs.LG

    Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Trajectory Design for 3D UAV Tracking

    Authors: Yujiao Zhu, Mingzhe Chen, Sihua Wang, Ye Hu, Yuchen Liu, Changchuan Yin

    Abstract: In this paper, the problem of using one active unmanned aerial vehicle (UAV) and four passive UAVs to localize a 3D target UAV in real time is investigated. In the considered model, each passive UAV receives reflection signals from the target UAV, which are initially transmitted by the active UAV. The received reflection signals allow each passive UAV to estimate the signal transmission distance w… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  21. arXiv:2401.10153  [pdf, other

    cs.NI cs.CV

    Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving

    Authors: Jie Lv, Haonan Tong, Qiang Pan, Zhilong Zhang, Xinxin He, Tao Luo, Changchuan Yin

    Abstract: This article studies the problem of image segmentation-based semantic communication in autonomous driving. In real traffic scenes, detecting the key objects (e.g., vehicles, pedestrians and obstacles) is more crucial than that of other objects to guarantee driving safety. Therefore, we propose a vehicular image segmentation-oriented semantic communication system, termed VIS-SemCom, where image seg… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 10 pages, 8 figures

  22. arXiv:2401.07329  [pdf, other

    cs.NE

    Attention-based UNet enabled Lightweight Image Semantic Communication System over Internet of Things

    Authors: Guoxin Ma, Haonan Tong, Nuocheng Yang, Changchuan Yin

    Abstract: This paper studies the problem of the lightweight image semantic communication system that is deployed on Internet of Things (IoT) devices. In the considered system model, devices must use semantic communication techniques to support user behavior recognition in ultimate video service with high data transmission efficiency. However, it is computationally expensive for IoT devices to deploy semanti… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

    Comments: 6 pages, 6 figures, accepted by IEEE WCNC 2024

  23. arXiv:2401.00137  [pdf, other

    cs.CR cs.CV

    SSL-OTA: Unveiling Backdoor Threats in Self-Supervised Learning for Object Detection

    Authors: Qiannan Wang, Changchun Yin, Lu Zhou, Liming Fang

    Abstract: The extensive adoption of Self-supervised learning(SSL) has led to an increased security threat from backdoor attacks. While existing research has mainly focused on backdoor attacks in image classification, there has been limited exploration of their implications for object detection. Object detection plays a critical role in security-sensitive applications, such as autonomous driving, where backd… ▽ More

    Submitted 12 June, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: 10 pages, 4figures

  24. arXiv:2312.13311  [pdf, other

    cs.LG eess.IV

    Unlocking Deep Learning: A BP-Free Approach for Parallel Block-Wise Training of Neural Networks

    Authors: Anzhe Cheng, Zhenkun Wang, Chenzhong Yin, Mingxi Cheng, Heng Ping, Xiongye Xiao, Shahin Nazarian, Paul Bogdan

    Abstract: Backpropagation (BP) has been a successful optimization technique for deep learning models. However, its limitations, such as backward- and update-locking, and its biological implausibility, hinder the concurrent updating of layers and do not mimic the local learning processes observed in the human brain. To address these issues, recent research has suggested using local error signals to asynchron… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: The paper has been accepted by ICASSP2024

  25. arXiv:2312.12723  [pdf, other

    cs.CV

    Multi-Clue Reasoning with Memory Augmentation for Knowledge-based Visual Question Answering

    Authors: Chengxiang Yin, Zhengping Che, Kun Wu, Zhiyuan Xu, Jian Tang

    Abstract: Visual Question Answering (VQA) has emerged as one of the most challenging tasks in artificial intelligence due to its multi-modal nature. However, most existing VQA methods are incapable of handling Knowledge-based Visual Question Answering (KB-VQA), which requires external knowledge beyond visible contents to answer questions about a given image. To address this issue, we propose a novel framewo… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  26. arXiv:2312.12721  [pdf, other

    cs.CV

    Cross-Modal Reasoning with Event Correlation for Video Question Answering

    Authors: Chengxiang Yin, Zhengping Che, Kun Wu, Zhiyuan Xu, Qinru Qiu, Jian Tang

    Abstract: Video Question Answering (VideoQA) is a very attractive and challenging research direction aiming to understand complex semantics of heterogeneous data from two domains, i.e., the spatio-temporal video content and the word sequence in question. Although various attention mechanisms have been utilized to manage contextualized representations by modeling intra- and inter-modal relationships of the t… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  27. arXiv:2312.12667  [pdf, other

    cs.CR cs.AI cs.LG

    Discovering Malicious Signatures in Software from Structural Interactions

    Authors: Chenzhong Yin, Hantang Zhang, Mingxi Cheng, Xiongye Xiao, Xinghe Chen, Xin Ren, Paul Bogdan

    Abstract: Malware represents a significant security concern in today's digital landscape, as it can destroy or disable operating systems, steal sensitive user information, and occupy valuable disk space. However, current malware detection methods, such as static-based and dynamic-based approaches, struggle to identify newly developed (``zero-day") malware and are limited by customized virtual machine (VM) e… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: ICASSP 2024, Accepted

  28. arXiv:2312.10987  [pdf, other

    cs.CL

    Cross-Subject Data Splitting for Brain-to-Text Decoding

    Authors: Congchi Yin, Qian Yu, Zhiwei Fang, Jie He, Changping Peng, Zhangang Lin, Jingping Shao, Piji Li

    Abstract: Recent major milestones have successfully decoded non-invasive brain signals (e.g. functional Magnetic Resonance Imaging (fMRI) and electroencephalogram (EEG)) into natural language. Despite the progress in model design, how to split the datasets for training, validating, and testing still remains a matter of debate. Most of the prior researches applied subject-specific data splitting, where the d… ▽ More

    Submitted 14 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  29. arXiv:2312.06330  [pdf, other

    cs.CV cs.AI cs.RO eess.IV

    Navigating Open Set Scenarios for Skeleton-based Action Recognition

    Authors: Kunyu Peng, Cheng Yin, Junwei Zheng, Ruiping Liu, David Schneider, Jiaming Zhang, Kailun Yang, M. Saquib Sarfraz, Rainer Stiefelhagen, Alina Roitberg

    Abstract: In real-world scenarios, human actions often fall outside the distribution of training data, making it crucial for models to recognize known actions and reject unknown ones. However, using pure skeleton data in such open-set conditions poses challenges due to the lack of visual background cues and the distinct sparse structure of body pose sequences. In this paper, we tackle the unexplored Open-Se… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI 2024. The benchmark, code, and models will be released at https://github.com/KPeng9510/OS-SAR

  30. arXiv:2311.08945  [pdf, other

    math.OC cs.DC cs.LG

    A Single-Loop Algorithm for Decentralized Bilevel Optimization

    Authors: Youran Dong, Shiqian Ma, Junfeng Yang, Chao Yin

    Abstract: Bilevel optimization has gained significant attention in recent years due to its broad applications in machine learning. This paper focuses on bilevel optimization in decentralized networks and proposes a novel single-loop algorithm for solving decentralized bilevel optimization with a strongly convex lower-level problem. Our approach is a fully single-loop method that approximates the hypergradie… ▽ More

    Submitted 23 April, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  31. arXiv:2311.04014  [pdf, other

    cs.AI math.OC

    A Method to Improve the Performance of Reinforcement Learning Based on the Y Operator for a Class of Stochastic Differential Equation-Based Child-Mother Systems

    Authors: Cheng Yin, Yi Chen

    Abstract: This paper introduces a novel operator, termed the Y operator, to elevate control performance in Actor-Critic(AC) based reinforcement learning for systems governed by stochastic differential equations(SDEs). The Y operator ingeniously integrates the stochasticity of a class of child-mother system into the Critic network's loss function, yielding substantial advancements in the control performance… ▽ More

    Submitted 1 January, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: 15 pages, 2 figures

  32. arXiv:2310.07885  [pdf, other

    cs.LG cs.AI

    Leader-Follower Neural Networks with Local Error Signals Inspired by Complex Collectives

    Authors: Chenzhong Yin, Mingxi Cheng, Xiongye Xiao, Xinghe Chen, Shahin Nazarian, Andrei Irimia, Paul Bogdan

    Abstract: The collective behavior of a network with heterogeneous, resource-limited information processing units (e.g., group of fish, flock of birds, or network of neurons) demonstrates high self-organization and complexity. These emergent properties arise from simple interaction rules where certain individuals can exhibit leadership-like behavior and influence the collective activity of the group. Motivat… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  33. arXiv:2310.00626  [pdf, other

    cs.CV cs.CR

    GhostEncoder: Stealthy Backdoor Attacks with Dynamic Triggers to Pre-trained Encoders in Self-supervised Learning

    Authors: Qiannan Wang, Changchun Yin, Zhe Liu, Liming Fang, Run Wang, Chenhao Lin

    Abstract: Within the realm of computer vision, self-supervised learning (SSL) pertains to training pre-trained image encoders utilizing a substantial quantity of unlabeled images. Pre-trained image encoders can serve as feature extractors, facilitating the construction of downstream classifiers for various tasks. However, the use of SSL has led to an increase in security research related to various backdoor… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

    Comments: 24 pages,8 figures

  34. arXiv:2309.12368  [pdf, other

    cs.HC cs.AI cs.LG

    Rethinking Human-AI Collaboration in Complex Medical Decision Making: A Case Study in Sepsis Diagnosis

    Authors: Shao Zhang, Jianing Yu, Xuhai Xu, Changchang Yin, Yuxuan Lu, Bingsheng Yao, Melanie Tory, Lace M. Padilla, Jeffrey Caterino, Ping Zhang, Dakuo Wang

    Abstract: Today's AI systems for medical decision support often succeed on benchmark datasets in research papers but fail in real-world deployment. This work focuses on the decision making of sepsis, an acute life-threatening systematic infection that requires an early diagnosis with high uncertainty from the clinician. Our aim is to explore the design requirements for AI systems that can support clinical e… ▽ More

    Submitted 26 February, 2024; v1 submitted 17 September, 2023; originally announced September 2023.

    Comments: Accepted by CHI'24

    MSC Class: 68U35 ACM Class: H.5.2; I.2.1

  35. arXiv:2309.10305  [pdf, other

    cs.CL

    Baichuan 2: Open Large-scale Language Models

    Authors: Aiyuan Yang, Bin Xiao, Bingning Wang, Borong Zhang, Ce Bian, Chao Yin, Chenxu Lv, Da Pan, Dian Wang, Dong Yan, Fan Yang, Fei Deng, Feng Wang, Feng Liu, Guangwei Ai, Guosheng Dong, Haizhou Zhao, Hang Xu, Haoze Sun, Hongda Zhang, Hui Liu, Jiaming Ji, Jian Xie, JunTao Dai, Kun Fang , et al. (30 additional authors not shown)

    Abstract: Large language models (LLMs) have demonstrated remarkable performance on a variety of natural language tasks based on just a few examples of natural language instructions, reducing the need for extensive feature engineering. However, most powerful LLMs are closed-source or limited in their capability for languages other than English. In this technical report, we present Baichuan 2, a series of lar… ▽ More

    Submitted 20 September, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: Baichuan 2 technical report. Github: https://github.com/baichuan-inc/Baichuan2

  36. arXiv:2308.13259  [pdf, other

    cs.CL cs.AI

    Knowledge-Driven CoT: Exploring Faithful Reasoning in LLMs for Knowledge-intensive Question Answering

    Authors: Keheng Wang, Feiyu Duan, Sirui Wang, Peiguang Li, Yunsen Xian, Chuantao Yin, Wenge Rong, Zhang Xiong

    Abstract: Equipped with Chain-of-Thought (CoT), Large language models (LLMs) have shown impressive reasoning ability in various downstream tasks. Even so, suffering from hallucinations and the inability to access external knowledge, LLMs often come with incorrect or unfaithful intermediate reasoning steps, especially in the context of answering knowledge-intensive tasks such as KBQA. To alleviate this issue… ▽ More

    Submitted 28 October, 2023; v1 submitted 25 August, 2023; originally announced August 2023.

  37. arXiv:2308.04673  [pdf, other

    cs.CR cs.AI

    SSL-Auth: An Authentication Framework by Fragile Watermarking for Pre-trained Encoders in Self-supervised Learning

    Authors: Xiaobei Li, Changchun Yin, Liyue Zhu, Xiaogang Xu, Liming Fang, Run Wang, Chenhao Lin

    Abstract: Self-supervised learning (SSL), a paradigm harnessing unlabeled datasets to train robust encoders, has recently witnessed substantial success. These encoders serve as pivotal feature extractors for downstream tasks, demanding significant computational resources. Nevertheless, recent studies have shed light on vulnerabilities in pre-trained encoders, including backdoor and adversarial threats. Safe… ▽ More

    Submitted 6 December, 2023; v1 submitted 8 August, 2023; originally announced August 2023.

  38. arXiv:2308.00531  [pdf, ps, other

    cs.NI

    Adaptive Bitrate Video Semantic Communication over Wireless Networks

    Authors: Wentao Gong, Haonan Tong, Sihua Wang, Zhaohui Yang, Xinxin He, Changchuan Yin

    Abstract: This paper investigates the adaptive bitrate (ABR) video semantic communication over wireless networks. In the considered model, video sensing devices must transmit video semantic information to an edge server, to facilitate ubiquitous video sensing services such as road environment monitoring at the edge server in autonomous driving scenario. However, due to the varying wireless network condition… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  39. arXiv:2305.18514  [pdf, other

    quant-ph cond-mat.stat-mech cs.CC math-ph

    Polynomial-time classical sampling of high-temperature quantum Gibbs states

    Authors: Chao Yin, Andrew Lucas

    Abstract: The computational complexity of simulating quantum many-body systems generally scales exponentially with the number of particles. This enormous computational cost prohibits first principles simulations of many important problems throughout science, ranging from simulating quantum chemistry to discovering the thermodynamic phase diagram of quantum materials or high-density neutron stars. We present… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 4+4 pages; 0+1 figure

  40. arXiv:2305.11916  [pdf, other

    cs.CL

    F-PABEE: Flexible-patience-based Early Exiting for Single-label and Multi-label text Classification Tasks

    Authors: Xiangxiang Gao, Wei Zhu, Jiasheng Gao, Congrui Yin

    Abstract: Computational complexity and overthinking problems have become the bottlenecks for pre-training language models (PLMs) with millions or even trillions of parameters. A Flexible-Patience-Based Early Exiting method (F-PABEE) has been proposed to alleviate the problems mentioned above for single-label classification (SLC) and multi-label classification (MLC) tasks. F-PABEE makes predictions at the cl… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: accepted by ICASSP-2023

  41. arXiv:2305.08353  [pdf, other

    cs.DS cs.LG

    Fast and Efficient Matching Algorithm with Deadline Instances

    Authors: Zhao Song, Weixin Wang, Chenbo Yin, Junze Yin

    Abstract: The online weighted matching problem is a fundamental problem in machine learning due to its numerous applications. Despite many efforts in this area, existing algorithms are either too slow or don't take $\mathrm{deadline}$ (the longest time a node can be matched) into account. In this paper, we introduce a market model with $\mathrm{deadline}$ first. Next, we present our two optimized algorithms… ▽ More

    Submitted 12 February, 2024; v1 submitted 15 May, 2023; originally announced May 2023.

  42. arXiv:2303.07537  [pdf, other

    cs.LG q-bio.QM

    Fractional dynamics foster deep learning of COPD stage prediction

    Authors: Chenzhong Yin, Mihai Udrescu, Gaurav Gupta, Mingxi Cheng, Andrei Lihu, Lucretia Udrescu, Paul Bogdan, David M Mannino, Stefan Mihaicuta

    Abstract: Chronic obstructive pulmonary disease (COPD) is one of the leading causes of death worldwide. Current COPD diagnosis (i.e., spirometry) could be unreliable because the test depends on an adequate effort from the tester and testee. Moreover, the early diagnosis of COPD is challenging. We address COPD detection by constructing two novel physiological signals datasets (4432 records from 54 patients i… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

    Comments: Published on Advanced Science

  43. arXiv:2303.06962  [pdf, other

    cs.IT eess.SP

    A Novel Two-Layer Codebook Based Near-Field Beam Training for Intelligent Reflecting Surface

    Authors: Tao Wang, Jie Lv, Haonan Tong, Changsheng You, Changchuan Yin

    Abstract: In this paper, we study the codebook-based near-field beam training for intelligent reflecting surfaces (IRSs) aided wireless system. In the considered model, the near-field beam training is critical to focus signals at the location of user equipment (UE) to obtain prominent IRS array gain. However, existing codebook schemes cannot achieve low training overhead and high receiving power simultaneou… ▽ More

    Submitted 18 April, 2023; v1 submitted 13 March, 2023; originally announced March 2023.

    Comments: 6 pages, 4 figures

  44. arXiv:2303.06747  [pdf, other

    cs.CV eess.IV

    Raising The Limit Of Image Rescaling Using Auxiliary Encoding

    Authors: Chenzhong Yin, Zhihong Pan, Xin Zhou, Le Kang, Paul Bogdan

    Abstract: Normalizing flow models using invertible neural networks (INN) have been widely investigated for successful generative image super-resolution (SR) by learning the transformation between the normal distribution of latent variable $z$ and the conditional distribution of high-resolution (HR) images gave a low-resolution (LR) input. Recently, image rescaling models like IRN utilize the bidirectional n… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

  45. arXiv:2303.02304  [pdf, other

    cs.LG

    Coupled Multiwavelet Neural Operator Learning for Coupled Partial Differential Equations

    Authors: Xiongye Xiao, Defu Cao, Ruochen Yang, Gaurav Gupta, Gengshuo Liu, Chenzhong Yin, Radu Balan, Paul Bogdan

    Abstract: Coupled partial differential equations (PDEs) are key tasks in modeling the complex dynamics of many physical processes. Recently, neural operators have shown the ability to solve PDEs by learning the integral kernel directly in Fourier/Wavelet space, so the difficulty for solving the coupled PDEs depends on dealing with the coupled mappings between the functions. Towards this end, we propose a \t… ▽ More

    Submitted 8 December, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023

  46. CTRLStruct: Dialogue Structure Learning for Open-Domain Response Generation

    Authors: Congchi Yin, Piji Li, Zhaochun Ren

    Abstract: Dialogue structure discovery is essential in dialogue generation. Well-structured topic flow can leverage background information and predict future topics to help generate controllable and explainable responses. However, most previous work focused on dialogue structure learning in task-oriented dialogue other than open-domain dialogue which is more complicated and challenging. In this paper, we pr… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: 12 pages, to be published in The Web Conference 2023

  47. arXiv:2302.14648  [pdf, other

    cs.IT cs.AI cs.LG

    Digital Over-the-Air Federated Learning in Multi-Antenna Systems

    Authors: Sihua Wang, Mingzhe Chen, Cong Shen, Changchuan Yin, Christopher G. Brinton

    Abstract: In this paper, the performance optimization of federated learning (FL), when deployed over a realistic wireless multiple-input multiple-output (MIMO) communication system with digital modulation and over-the-air computation (AirComp) is studied. In particular, a MIMO system is considered in which edge devices transmit their local FL models (trained using their locally collected data) to a paramete… ▽ More

    Submitted 25 April, 2024; v1 submitted 4 February, 2023; originally announced February 2023.

  48. arXiv:2302.05406  [pdf, other

    cs.CL

    Adversarial Transformer Language Models for Contextual Commonsense Inference

    Authors: Pedro Colon-Hernandez, Henry Lieberman, Yida Xin, Claire Yin, Cynthia Breazeal, Peter Chin

    Abstract: Contextualized or discourse aware commonsense inference is the task of generating coherent commonsense assertions (i.e., facts) from a given story, and a particular sentence from that story. Some problems with the task are: lack of controllability for topics of the inferred facts; lack of commonsense knowledge during training; and, possibly, hallucinated or false facts. In this work, we utilize a… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

    Comments: Submitted to Semantic Web Journal special edition. https://semantic-web-journal.org/content/adversarial-transformer-language-models-contextual-commonsense-inference-1

  49. arXiv:2301.12833  [pdf, other

    cs.IT

    Sum-Rate Maximization for Active RIS-Aided Downlink RSMA System

    Authors: Xinhao Li, Tao Wang, Haonan Tong, Zhaohui Yang, Yijie Mao, Changchuan Yin

    Abstract: In this paper, the problem of sum-rate maximization for an active reconfigurable intelligent surface (RIS) assisted downlink rate-splitting multiple access (RSMA) transmission system is studied. In the considered model, the active RIS is deployed to overcome severe power attenuation, which is caused by the cumulative product of RIS incidence path loss and the reflection path loss. Since the active… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

  50. arXiv:2211.15118  [pdf, other

    cs.DS cs.LG

    A Faster $k$-means++ Algorithm

    Authors: Jiehao Liang, Somdeb Sarkhel, Zhao Song, Chenbo Yin, Junze Yin, Danyang Zhuo

    Abstract: $k$-means++ is an important algorithm for choosing initial cluster centers for the $k$-means clustering algorithm. In this work, we present a new algorithm that can solve the $k$-means++ problem with nearly optimal running time. Given $n$ data points in $\mathbb{R}^d$, the current state-of-the-art algorithm runs in $\widetilde{O}(k )$ iterations, and each iteration takes $\widetilde{O}(nd k)… ▽ More

    Submitted 13 February, 2024; v1 submitted 28 November, 2022; originally announced November 2022.